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EDITORIAL PREFACE 


The editors of the Applied Logic Series are pleased to present the second volume 
in the series, this one on Proof Theory and Modal Logic. In recent years the 
topic of multi-modal logics has had significant applications in a variety of 
disciplines, including Artificial Intelligence, theoretical computer science, 
philosophy, linguistics, Peano arithmetic, and generalized quantifiers. In this 


volume, leading researchers examine the proof theory of this rich and active 
field. 
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PREFACE 


This book deals with formal, mechanizable reasoning in modal logics, that is log- 
ics of necessity, possibility, belief, time, computations etc. It is therefore of vig- 
orous interest for various interrelated disciplines like Philosophy, Artificial Intelli- 
gence, Computer Science, Logic, Cognitive Science and Linguistics. Proof Theory 
of Modal Logic is the first book focusing entirely on matters of modal proof theory 
and for the first time brings together various directions in the development of gen- 
eralized sequent-style proof systems. 


Though early this century modal logic started as a purely syntactic enterprise, 
modal proof theory did not undergo such a tremendous development as was expe- 
rienced by modal model theory after the emergence of possible worlds semantics 
around 1960. In the 1980s several distinguished modal logicians explicitly noted 
this discrepancy and since that time, various generalizations of the hitherto standard 
proof-theoretic formalisms have been developed and investigated. Suggestions for 
generalizing the ordinary notion of sequent have for instance led to higher-level, 
higher-arity, and higher-dimensional Gentzen-type proof systems for modal logics. 
Moreover, modal correspondence theory has been applied in order to use theorem 
provers for classical predicate logic. General proof-theoretic frameworks have also 
been devised, like Nuel Belnap’s Display Logic and Dov Gabbay’s theory of La- 
belled Deductive Systems. This lively development is still ongoing. 


Proof Theory of Modal Logic is the proceedings volume of a workshop on the 
proof theory of modal logic, held at the University of Hamburg on November 19— 
20, 1993. The aim of this workshop was to bring together various, up to then, more 
or less unrelated direetions of research in modal proof theory and thereby to stim- 
ulate significant progress in the field. The book consists of 15 research papers, di- 
vided into three parts. In the first part, papers are collected which give a profound 
description of powerful proof-theoretic methods as applied to the normal modal 
logic S4. Part II is devoted to generalizations of the standard proof-theoretic for- 
mats. This part contains papers either dealing with or related to (i) relational proof 
systems, (ii) Display Logic, or, more generally, approaches incorporating modal 
operators as structural elements, (111) higher-level and higher-dimensional proof sys- 
tems, and (iv) axiomatic presentations using the ‘irreflexivity-rule’. The third part 
presents new and important results on semantics-based proof systems for modal 
logics. 


The workshop culminating in the present volume was generously sponsored by 
the Stiftung Volkswagenwerk (Hannover). I would like to express my 
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gratitude to the Volkswagen Foundation for making the workshop possible and to 
Christopher Habel and the graduate programme in Cognitive Science 
(Graduiertenkolleg Kognitionswissenschaft) at the University of Hamburg for their 
additional support. I am grateful to Annette Tschernig for her assistance during the 
workshop. Special thanks go to (1) Dov Gabbay for supporting the publication of 
this book, (ii) Jane Spurr for preparing the final manuscript, and (iii) the colleagues 
who acted as referees in the process of preparing this manuscript, namely: 


Patrick Blackburn Makoto Kanazawa Luiz C. Pereira 


Valentin Goranko Marcus Kracht Krister Segerberg 

Rajeev Goré Joachim Lambek Valentin Shehtman 

Wiebe van der Hoek Andrea Masini Max Urchs 

Jan Jaspars Grigori Mints Richard Zach 
Heinrich Wansing 


Leipzig, September 1995 


PART I 


STANDARD PROOF SYSTEMS 


JORG HUDELMAIER 


A CONTRACTION-FREE SEQUENT CALCULUS 
FOR 54 


Theorem proving in the modal logic S4 is notoriously difficult, because in 
conventional sequent style calculi for this logic lengths of deductions are not 
bounded in terms of the length of their endsequent. This means that the usual depth 
first search strategy for backwards construction of deductions of given sequents 
may give rise to infinite search paths and is not guaranteed to terminate. Thus using 
such a search strategy prevents us not only from obtaining a decision procedure for 
the logic in question, but even from arriving at a complete proof procedure. There 
are two well known approaches for overcoming this problem: both approaches rely 
on the fact that all formulas which occur in a deduction are subformulas of the end- 
sequent and that out of these formulas one may only form finitely many “essentially 
different" sequents. Thus although there is no bound on the length of all deductions 
of a given sequent, we know that for any given sequent there is a number such that 
if the sequent is deducible at all, then it has a deduction of length smaller than this 
number. Hence by only considering deductions of appropriately bounded length 
one may obtain a decision procedure. But due to the fact that using such a proce- 
dure one is forced to construct many redundant inferences—one will, for instance, 
have to consecutively apply the same inference many times—this approach is con- 
sidered rather inefficient. Instead the usual technique for deciding provability of 
formulas in S4 is based on loop checking: If “essentially the same” sequent oc- 
curs twice on a branch of a constructed deduction, then there is a shorter deduction 
with the same endsequent which does not show this redundancy, and one may back- 
track. Although more efficient in terms of run time than the previous approach this 
loop checking method requires quite involved implementation techniques. Now in 
the context of intuitionistic propositional logic recently a third approach has been 
found, which is based on so called contraction free sequent calculi, i.e. calculi for 
which there is a certain measure such that for all rules of the calculi all premisses 
have smaller measure then the conclusion (cf. [1, 3]). Thus in a contraction free 
calculus all deductions of a given sequent are bounded in length by some function 
of the length of their endsequent and a decision procedure is obtained by simple 
depth first backwards application of the rules. In this paper we show that there is 
a contraction free sequent calculus for S4, too. This calculus is more complicated 
than the corresponding calculi for intuitionistic propositional logic, but still it gives 
rise to a simpler decision procedure for S4 than conventional methods. 
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1 INTRODUCTION 


We consider a language of sequents, i.e. pairs of multisets of formulas built up from 
propositional variables, the Boolean connectives ^ and V and the modal connective 
[]. 

We start from a calculus LMo for the modal logic S4 consisting of axioms of the 
form M,a = a, N, where a is a propositional variable, the well known Boolean 
rules 


M > N,v I M,v>N 

M,-v>N M > N,-v 

M,u> N M,v—N M > N,u,v 
E V ——————————————— I 


M,uvv>N M >N,uVvv 


and the two modal rules 


M,Ov, v > N IH M? >v 
M,Ov > N M > N,Uv 


where M? results from M by omitting all formulas not of the form Ov. 

Obviously the so called weakening rule is an admissible rule of this calculus: 
If a sequent M = N is deducible in LMo by a deduction of length n, then both 
the sequent M => N,v and the sequent M, v => N are deducible by deductions of 
length < n. Moreover it is immediately clear that the Boolean rules of LMo and the 
rule EC! are invertible: If a conclusion of one of these rules is deducible in LM, by 
a deduction of length n, then all its premisses are deducible by deductions of length 
< n. Therefore the following holds: 


LEMMA 1 a) Every LMo-deduction of a sequent M,v,v = N may be trans- 
formed into a deduction of the sequent M ,v => N of smaller or equal length. 

b) Every LMo-deduction of a sequent M => v,v, N may be transformed into a 
deduction of the sequent M — v, N of smaller or equal length. 


Proof. a) is true of axioms, and if M,v,v => N is the conclusion of an inference 
I different from ID with principal formula different from v, then the induction hy- 
pothesis applies to the premisses and by an application of J to the transformed pre- 
misses the sequent M, v = N may be obtained. If M, v,v = N is the conclusion 
of an ICl-inference, then either both occurrences of v are contained in M?, in which 
case the induction hypothesis applies to it and M, v => N may be obtained as be- 
fore, or neither occurrence of v is in M? and one of them may be introduced by 
weakening. If M,v,v => N is the conclusion of a Boolean rule B with principal 
formula v, then the premisses may be transformed according to the inversion prin- 
ciple for the Boolean rules and the resulting sequents may be transformed by the 
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induction hypothesis; finally to these transformed sequents the rule B may be ap- 
plied again, thereby obtaining a deduction of M,v = N. If M,v,v => N is the 
conclusion of an ELl-inference with principal formula v, then v is of the form Ou 
and the premiss reads M, Ou, Ou, u = N; the rule ED applied to this gives the 
required deduction of M,v > N. 

b) is also true of axioms, and if M => v,v, N is the conclusion of an inference J 
different from ID, then the deduction of M = v, N is constructed as above. But 
if M => v,v, N is the conclusion of an ILl-inference, then from its premiss we di- 
rectly obtain M = v, N by a different application of IL] with the same principal 
formula, viz. by introducing only one occurrence of v into the conclusion. a 


This shows that the calculus LMo is equivalent to the more frequently encoun- 
tered S4-calculi for sequents made up of pairs of sets of formulas (cf.[2]), and in 
particular the so called cut rule is also a derived rule of LMo: if two sequents M => 
c and M,c => v are deducible by LMp, then so is the sequent M => v. 


2 REDUCTION TO CLAUSAL FORM 


In order to determine deducibility of sequents by our calculus LMo we may restrict 
the language to so called clausal sequents: there is a simple procedure which asso- 
ciates to every sequent s of the full language a clausal sequent C (s), such that s is 
derivable by LMo if and only if C(s) is derivable by LMo. 


DEFINITION 2 (Cf. [4]) a) A modal literal is either a propositional variable, a 
negated propositional variable, a formula of the form Qa, where a is a proposi- 
tional variable or a formula of the form ^Lla, where a is a propositional variable. 
b) A modal clause is a formula of the form low (ly V . ..) ...) orD(le Vl V...)...)), 
where the l; are modal literals. 

c) A clausal sequent is a sequent of the form c,,...,Cm => a4,...,0&, where the 
c, are modal clauses and the a; are propositional variables. 


For simplifying notation we let the expression [vo, .. ., v,](n > 0) denote the 
formula vp V v; V (... V v,)...). Thus a formula [vo, .. . , v4] or O[vo,..., Un] is 
a modal clause iff all the v; are modal literals. 

Then the following holds: 


LEMMA 3 The sequents in the first column of the following table are deducible by 
LMp if and only if the corresponding sequents of the second column are deducible: 
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M >v, N M, ~v > N 

M,[A,22v, B] > N M,[A,v, B] > N 

M, [A,~(u V v), B] > N M, [A, ^p. B], [p, ^u], [p, ^v] > N 
M,[A,u V v, B] > N M,[A, u,v, B] > N 

M,|A,-O-v, B] > N M, [A, -Op, B], Olp, v] > N 
M,|A,-O(uv v), B] > N M, [4, -Op, B], Olp, ^u], Ofp, ^v] > N 
M,[A, 200v, B] > N M,[A,-Qv, B] > N 

M, [A4,O-v, B] > N M, [A, Op, B], [^p, ^v] > N 
M,[A,O(uv v), B] > N M, [A, Op, B], O[-p, u, v] > N 

M, [4, 00v, B] > N M,[A,Qu, B] > N 

M,U[A,, 22v, B] > N M,O[A,v, B] > N 

M,U[A, ~(u v v), B] > N M,U[A, ^p, B], Ofp, ^u], O[p, ^v] > N 
M,U[A,u vv, B] 3 N M,U[A,u,v, B] > N 

M,U[A, -O-v, B] > N M,O/A, -Op, B], Olp, v] > N 
M,U[A,-Q(uVv),B| > N M,O[A,-Op, B], Ofp, ^u], Olp, ^v] > N 
M,O[A, -O0v, B] > N M,O/A,-Ov, B] > N 

M,O[A, Dv, B] > N M,O[A, Op, B], O[-p, ^v] > N 
M,O[A,O(uvv),B)>N M,O[A, Op, B),O[-p, u,v) 2 N 

M, O[4, 00v, B] > N M,U[A, Ov, B] > N 


(Here the formulas p on the right hand sides are propositional variables which do 
not occur on the corresponding left hand sides.) 


Proof. This lemma is well known: for instance from a deduction of 
M,O/[A,-O(u V v), B] 2 N we obtain the required deduction of M, O[A, 
-Up, B], Olp, ^u] G[p, ^v] 2 N by a cut with the LMo-deducible sequent O[A, 
-Up, B], O[p, ^u], Ofp, ^v] > Of[A,70(u V v), B], whereas given a deduction 
of M,O[A, -Op, B], Olp, ^u], Op, ^v] = N we obtain the required deduction 
of M,O[A, 2D(u V v), B] = N by changing all occurrences of p to u V v and 
cutting out the two LMo-decucible formulas Ofu V v, ^u] and Ofv V v, ^v] from 
the resulting endsequent. B 


From this lemma we easily obtain a procedure for reducing a given sequent of 
arbitrary form to a clausal sequent. This procedure consists of two steps: 
a) Applying the property expressed by the first row of the above table to the formu- 
las v; on the right hand side of a given sequent a1, m..., am, Éldi,..., 
Ody, => v1,..., Uq, Pr, ---, Pr (where the a; are not of the form Ob, the v; are not 
propositional variables and the p; are propositional variables), we arrive at a new 
sequent which has only propositional variables on its right hand side and which has 
at most twice as many connectives as the given sequent. 
b) Applying the remaining rows to the resulting sequent [a1], . . . , [am], Oldı], 
...,O[d,], [^vi]... [^94] => pi,.--,Pr it is easily seen, that at each step the 
number of connectives inside one of the square brackets of this sequent, which do 
not belong to modal literals, decreases. This number is bounded by the number of 
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all connectives of the sequent which is itself bounded by twice the number of con- 
nectives of the original sequent. Since at each step the number of connectives of a 
sequent is at most increased by 1, this shows the required 


LEMMA 4 There is a procedure which converts any sequent s into a clausal se- 
quent C(s) such that the number of connectives of C(s) is at most four times the 
number of connectives of s and s is deducible by LM, iff C(s) is deducible. 


Now unfortunately in an LMo-deduction of a clausal sequent there will usually 
occur sequents which aren't clausal, viz. sequents occurring as premisses of appli- 
cations of E~ and having a formula Da on their right hand side. But the following 
lemma shows that we may combine such applications of E~ with an immediately 
preceding application of ID into a single new rule, thus preserving clausal forms 
for all sequents of a deduction: 


LEMMA 5 There is a transformation sending every LMo-deduction of a given se- 
quent into a deduction of the same sequent such that in the new deduction every 
premiss of an application of E~ with principle formula, =Qv is the conclusion of 
an application of IO, with principle formula Dv. 


Proof. We consider a maximal application J of E~ with principal formula ~Ov 
and premiss s for which Ov is not the principal formula of the inference leading to 
s and we use recursion on the maximal number of successive sequents preceding 
s in which Uv occurs on the right hand side to turn it into an E--inference hav- 
ing the required property: If this number is 1, then either s is an axiom or Ov must 
have disappeared because of some application of IC] with principal formula differ- 
ent from Ov and in both cases the E—-inference J may be dropped. If this number is 
greater than 1 and the inference leading to s is either a Boolean inference with prin- 
cipal formula not of the form ~Ow or an EL-inference, then this inference may be 
shifted down past the E--inference, thereby lowering the recursion parameter. But 
if the inference leading to s is another E—-inference J with principal formula -Ow, 
then its premiss is the conclusion of an IL]-inference with principal formula Ow and 
again the inference J may be dropped. Using this technique we will eventually turn 
all E--inferences of a given deduction into inferences of the form required by this 
lemma. E 


Using this lemma we may now replace any pair of inferences consisting of an 
application of IL] with principal formula Ov and an application of E~ with principle 
formula ^L]v immediately following it by a new inference leading directly from 
the premiss of the ILJ-inference to the conclusion of the E--inference. This gives 
us the new calculus LM, which has the axioms and the rules ED) of LM; and the 
following divided E—-rule: 


M => a, N . M? > a 
^$—— —————— (a a propositional variable) E-2d————————— 
M,-a=>wN M,-Oa=>N 
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Now in an LM; -deduction of a clausal sequent there occur only clausal sequents, 
and the rules just mentioned are the only rules of LMo which are applicable to 
clausal sequents. Therefore since LM, shall only be used for clausal sequents, it 
does not need any further rules; in particular it does not need any I-rules for the 
following lemma to hold: 


LEMMA 6 The calculi LMo and LM, are equivalent as regards deducibility of 
clausal sequents. 


Therefore the rules E^s and EV are invertible rules of this calculus, too, and 
we may safely extend the EV-rule to disjunctions of arbitrary length, showing the 
following: 


LEMMA 7 The following calculus LM» is equivalent to LM: LMg has the ax- 
ioms and the rule Ed of LM, and it has the rules EV and ED in the form 


M,a >N ... Ma — N 

M > bı, N ... M > ba, N 

M,Q¢q >N ... M,Qc, > N 
M,-OQd,>N ... M,7~Qd, > N 
EV — — M5 N  ——— 


M,OQv,ay >N ... M,Ov,ap > N 
MUūv > bı, N ... Mv > b, N 
M,Uv,Uci > N ... M,Qv,Qc, > N 
: M,Uv,-Ud; > N ... M,Qv,7-Od, > N 


E ‚Uv => 


where v is the formula |a,...,@p,7b1,...,7bg,Oc,...,0c,,7Od,,..., 
Od], the ai, bj, c and d, are propositional variables, p + q +r + s > Oandv 
is meant to represent any permutation of its subformulas. 


3 THE CALCULUS 


Here we note, that the rule E~s of LM; is a special case of the rule EV and that due 
to the invertibility of the E-s-rule the formulas b; in the rules EV and EO may be 
put to the right hand side of the premisses in the second row, whereas the formu- 
las ~d; have to remain on the left hand side, because the rule E-d is not directly 
invertible. Still we show in the next lemma that any LM2-deduction may be trans- 
formed in such a way that these formulas can immediately be put to the right hand 
sind, too. For this purpose we call the premisses in the first and second rows of the 
EV- and EC-rules a-premisses, those in the third row 3-premisses and those in the 
fourth row y-premisses. 
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LEMMA S There is a transformation sending any LM»-deduction of a given se- 
quent into another deduction of the same sequent, such that in the new deduction 
every ^y-premiss is the conclusion of an application of Ed. 


Proof. In order to construct such a deduction from a given arbitrary deduction we 
consider a maximal *-premiss s = M,QOv,-Od => N of an inference J of our de- 
duction, which isn't the conclusion of an application of E-d (here v = [a,..., dp, 
ab;,...,7bg,0e1,...,Oc,, ^Dd;,..., 0ds, ^UOd]), and we use recursion on 
the maximal number of successive sequents preceding s in which ^Lld occurs: if it 
is 0, then either s is an axiom, or it is the conlcusion of an applciation of E-d with 
principal formula different from —L1d. In both cases the inference whose premiss s 
is may be dropped. If this number is greater than 0, and the inference J leading to 
s is an application of E~s, then we may shift J down past J in the usual way and 
lower the recursion parameter. But if J is by an application of a multipremiss rule, 
e.g. EL), then we proceed as follows: 

Let w be [ej,...,e5, ^fi... fu Ug... , Dg», ^OR,,..., Dhu], let 
M be L, Ow and let L,Ow,0Ov => N be the conclusion of J, let L, Ow, Ov, 
a; > N and L,Ow, Ov => bj, N be its a-premisses, L, Ow, Ov, Oc => N its 
B-premisses and L, Ow, Ov, -Od; => N and s its y-premisses: then the premisses 
of J are of the form L,Ow,Ov,7-Od,ey = N resp. L,Ow,Ov,-Od 
=> fp, N resp. L, Ow, Ov, -Od,Og, = N resp. L, Ow, Dv, 70d, Dh => 
N. Now since J was a maximal inference not obeying the lemma, the sequents 
L,Ow,Ov,-7-Od,-0hy = N are immediately preceded by sequents L°, Ow, 
Ov => by. Thus we may shift the J-inference below the J-inference: 

From the /-premisses different from s we obtain by weakening sequents 
L,Ow, Ov, a; ep => N, sequents L, Ow, Ov, ep =,6;N, sequents L, Ow, Ov, 
Olc;,,e; => N and sequents L, Ow, Ov, 2Cld;, e; = N and we combine these 
with the J-premiss L, Ow, Ov, -Od,e; = N to form the premisses of an ED- 
inference leading to L, Ow, Ov, e; = N. Similar ED-inferences provide us with 
the sequents L,Ow,Ov => f;, N and L, Ow, Ov, Og = N, and combining 
these with the sequents L, Dw, Ov, -Ohy = N which result from L^, Ow, Ov > 
by we obtain all th premisses of an E[T-inference leading to L,Ow,Ov > N, 
i.e. the conclusion of J. Now in the new deduction the recursion parameter has 
decreased and at the same time—although the total number of ELJ-inferences has 
increased—the sum of the recursion parameters of inferences on the single branches 
of our deduction has not increased. Therefore we may successively eliminate all 
the ELl-inferences in our deduction which violate the condition expressed in this 
lemma. 

Now as before we may combine any EV- resp. ELl-inference with the E-d in- 
ferences leading to its y-premisses into a single new inference and we obtain the 
calculus LMs consisting of the usual axioms and the two rules 
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M,a >N ... M,ap> N 
M => b,N ... M >bg, N 
M,Oc >N ... M,Oc 5 N 
0 0 
EV MY > dı ... M? > d; 
M,v > N 


M,Uv,a1— N ... M,Qv,a,>N 
MOv>0,,N ... MOvu>b,,N 
M,Uv,Dc; > N ... M,Qv,Qc, > N 
M9,Dv > di ... M?, Ov > d, 
EO M,Ov > N 


which are to be read as before, except that in case p + q + s = 0, the number r 
has to be > 1. Now the EV-rule also comprises the E-d-rule of LM? thus we have 
shown: 


LEMMA 9 The calculi LMz and LM; are equivalent. 


Now the rule EV isn't invertible any more, but it still holds that if its conclusion 
is deducible, then so are its œ- and 8-premisses. This shows: 


LEMMA 10 Every LM3-deduction of a sequent M , v, v = N may be transformed 
into a deduction of the sequent M, v => N of smaller or equal length. 


Proof. This is true of axioms, and if M,v,v = N is the conclusion of an infer- 
ence J with principal formula different from v, then the premisses of this inference 
contain either both occurrences of v or none and to the former ones the induction 
hypothesis applies. Thus by an application of J to these transformed premisses and 
to those which contain no occurrence of v the sequent M, v = N may be obtained. 
If M,v,v = N is the conclusion of an EV-inference with principal formula v, then 
the above mentioned restricted inversion principle for EV may be applied to the a- 
and (—premisses and then these premisses may be transformed according to the in- 
duction hypothesis, whereas the *-premisses do not contain v. Thus by applying 
EV to the transformed a- and 8-premisses and to the original -premisses we arrive 
at the required deduction of M, v => N. Finally if our sequent is the conclusion of 
an ELl-inference with principal formula v, then the induction hypothesis directly 
applies to all premisses and the required sequent may be derived by a similar ap- 
plication of ELI to the transformed premisses. a 


Now while for the rule EV all premisses have smaller length than the conclusion, 
this is not the case for EL). Therefore we need the following 


A CONTRACTION-FREE SEQUENT CALCULUS FOR S4 ll 


LEMMA 11 a) Every LM3-deduction ofa sequent M, O(A, u] => N ora sequent 
M,O[A, Qu] 2 N may be transformed into a deduction of the sequent M, Du > 
N of smaller or equal length. 

b) A sequent M, Ofa, Du], Du = N is deducible by LM; iff the sequent M, Du => 
N is deducible. 


Proof. a) This is true for axioms, and it is trivially preserved under inferences with 
principal formulas different from O[A,u] resp. O[A, Ou]. For an EO- 
inference with principal formula Dfa, u] resp. O[A, Du] one premiss contains u 
resp. Ou and to this premiss the induction hypothesis applies and yields a sequent 
with two occurrences of u resp. Qu. Thus by an application of the preceding lemma 
we obtain deductions of the required sequents. 

b) If M,O[A, Du], Du = N is deducible, then (a) and Lemma 10 show that 
M,Qu => N is deducible too. If this latter sequent is deducible, then weakening 
shows that M, O[A, Du], Ou = N is deducible. H 


From this follows: 


LEMMA 12 The calculus LM, consisting of the usual axioms, the rule EV and the 
two ED-rules 


M,Qv,aq >N ... M,Qv,a, > N 
M,Ov > bı, N ... M,Qu>6,,N 


M, 00, a >N... M, 0v, ap > N 
MO > b, N ... MO => ba, N 
M,Qv,0c; >N ... M,ňOv, Oc, > N 
M? Ov > di ... M?, Ov > d, 
Ei——— Wem N o — 


where v is the formula |[aj,...,a5,—b1,... , "by, D, ..., Dc, "Uds, .. ., 
-2Ud;,], s = 0 for EDs, s > 0 for EOd and i is the formula |ai, ..., a5, bi, . .., 
ab,,Oc1,...,Oc,], deduces all the sequents which LM; deduces. (Henceforth we 
shall call formulas v with s = 0 shallow formulas and those with s > 0 deep for- 
mulas.) 


Proof. All we have to show, is that the EL]-rule of LM3 is admissible for this new 
calculus: so given all the premisses of an application of EO, Lemma 11 allows us 
to transform all its G-premisses into a form suitable for premisses of our new ED- 
rules. Moreover if the principal formula v of the given application of ED is deep, 
then the same lemma allows us to transform this formula into ? in all o-premisses. 
Thus we may deduce the conclusion of any application of EO from its premisses 
by applying either the rule EDs or EOd. E 
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This shows one half of the 
LEMMA 13 The calculi LM3 and LM, are equivalent. 


Proof. To prove the other direction we have to show that both ED-rules of LM, are 
admissible for LM3: for EDs this follows directly from Lemma 11. For EOd we 
rely on the equivalence of LM3 and LMo: suppose we are given all the premisses of 
an application of Edd, then from the y-premisses M9, Ov => d;, we may deduce in 
LM o the sequents M, Ov, ^L1d; = N, and from the B-premises we may as before 
deduce the sequents M, Ov, Ock => N. Thus by suitable applications of the rules 
E- and EV from all premisses we may deduce the sequent 


M,..., V2 (2D V ~a;) V ... V 2(2Db V 5b) V ... 
V-(-0v V 2e) V ...... V 23(2Uv V 2-Udj) V ... > N. 


But we may also deduce 


M,Uv > ...V 2(2D V 22i) V ... V 2-)2CI$ V 5b) V ... 
... V 2(4Ulv V 2e) V ... V 2(2UIv V ^-Uldi) V ..., N. 


Thus by an application of the cut rule we may deduce the required conclusion 
M,Ov => N of Ed. E 


Now for LM, the B-premisses of both EO-rules and the a-premisses of EOd 
are shorter than the conclusion, whereas for the -y-premisses of both rules and the 
a-premisses of ECs this does not hold. We deal with the latter premisses first. 


LEMMA 14 There is a transformation which converts every LM4-deduction of a 
given sequent into another LM4-deduction of the same sequent in which no a- 
premiss of an application of EUs is the conclusion of EQd. 


Proof. Given an arbitrary LM4-deduction w.l.o.g. we consider a maximal a- 
premiss s = M,QOv,a => N of an application of EDs, which is the conclusion 
of an application of ELld (here v = [a,a1,...,@)701,...,6,,Oc1,...,Oc,]), and 
we use recursion on the maximal number of successive applications of ECld pre- 
ceding s: this number cannot be 0, and thus we let w = [e1,...,e2, ^ fi, ..., ^ fu, 
L1g1, ..., Lgs, "Dh, . .. , "DI hu] be the principal formula of the application J of 
EOd leading to s, we let M be L, Ow and we let L, Ow,Ov => N be the con- 
clusion of J, the sequents s and L, Ow, Ov, a; > N and L, Ow, Ov => bj, N its 
a-premisses, L, Ow, Ov, Ock = N its G-premisses: then the premisses of J are 
of the form L, Ow, Dv,a, e; => N resp. L,Ow,Ov,a > f;,N resp. L, Dw, 
Ov,a,Og, => N resp. L°, Ow, Ov, > hy and the inference J is shifted down 
past J as follows: 

We use the given sequent L,Ow,QOv,a,e; = N and the sequents L, Ow, 
Ov,a;,er => N and L,Ow,Ov,ex = bj, N and L, Ow, Ov, Ock, e; > N 
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(where these latter sequents are obtained from the corresponding J-premisses by 
weakening) as premisses of an application of EOs leading to L,Ow, Dv, e; 
=> N. Similarly we obtain deductions of the sequents L, Ow, Ov = f, N resp. 
L,Qw,Ov,Og,: = N and from these and L°, Ow, Ov, = hy using an EDd- 
inference with principal formula Ow we arrive at a deduction of L,Ow,Ov => 
N where the recursion parameter has decreased by 1. Now the number of EDs- 
inferences may have increased in this new deduction, but the maximal sum of re- 
cursion parameters on any branch of our deduction cannot have increased. Hence 
we may in this way eliminate all the o-premisses of EDs which are conclusions of 
applications of EOd. Hu 


This lemma is applied for proving completeness of the calculus LMs which in 
addition to the modal operator O uses a new operator (©) to be substituted for O in 
certain situations: LM; has besides the usual axioms the rules EV, EC]s and EOd 
in the following form: 


M?,a,>N ... May > N 
M?, >b, N ... M? => bp, N 
M?,0c,>N ... M?, 0ce >N 
EV M? > dı ... M? > d; 
| Movs N o 


M!,0v,a, >N ... M?,Ov,ap > N 
M1,0v >b, N ... M!,Ov > b, N 
M?,0q > N ... M?, Oc, > N 


EOs——————— Mos N o 


M?,00,a >N ... M°, O0, ap > N 
M?, O0 > by, N ... M?,00 > ba, N 
M?, Oc >N ... M?, Oc > N 


where M! results from M by replacing any formula Ov with deep v by (Qv, M? re- 
sults from M by replacing any formula Ov by Ov, and M? results from M by omit- 
ting all non modalized formulas, i.e. all formulas not of the form Ov or (Ov. There 
is no rule for introducing the operator C), thus any application of one of the rules of 
LM; becomes a valid application of the corresponding LM,-rule if we replace any 
Q in all premisses and in the conclusion by a D. This shows that if M, (Ov > N 
is deducible by LMs, then M, Qu => N is deducible by LM; and by LM,; and in 
general: if M = N is deducible by LMs, then M? => N is deducible by LM; and 
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by LM,—one half of the equivlanence of LM, and LMs. For the proof of the other 
direction we call a formula Dv distant in a deduction d of a sequent m, Ov > N 
iff below any conclusion of an inference with principal formula Dv there is a con- 
clusion of an EV-inference or a 3-premiss. Then we show 


LEMMA 15 Any LM,-deduction d of a sequent M = N may be transofrmed into 
an LMs-deduction of the sequent M* = N, where M* results from M by replac- 
ing any number of formulas Ov distant in d by (Ov. 


Proof. Suppose we are given an LM4-deduction of a sequent s. We may assume 
that it has the property expressed by Lemma 14. If the final inference of this deduc- 
tion is an application of EV, then by the induction hypothesis the deductions of all 
premisses may be transformed into LM;-deductions of the same sequents and these 
are the premisses of an EV-inference of LM;-inference leading to the required se- 
quent. If the final inference is an application of EOd, then by the induction hypoth- 
esis the a- and -premisses are deducible by LMs and these are the corresponding 
premisses of an ELId-inference of LM; leading to the required sequent. But the y- 
premisses necessary for this inference are obtained from the given *y-premisses by 
replacing sufficiently many distant formulas Ov by Qu, because all the distant for- 
mulas of s are distant in all y-premisses, too. Therefore the required sequent is de- 
ducible by LMs. If the final inference J is an application of EDs with principle for- 
mula Ov, where v = [a1,...,@p,7b1,..., nbg, De, .. . , Oc,] and the formula Ov 
is not distant in an a-premiss preceding P and such that one of these a-premisses 
is the conclusion of another inference with principle formula Ov. In this case we 
may drop the inference J. Otherwise the formula Ov is distant in all a-premisses 
of J, and also all the distant formulas of our sequent M, Ov => N in its deduction d 
are distant formulas in all a-premisses too. Now again by the property of Lemma 
14 in the a-premisses at least all formulas Ow with w deep are distant. Thus by 
the induction hypothesis all sequents M^, Ov, a; = N resp. M^, Ov => b;, N are 
deducible, and they are the a-premisses of an LM;-application of ELIs leading to 
M+, Ov = N. Moreover as before the transformed deductions of the 3-premisses 
again yield the G—premisses of a corresponding LM;-inference leading to the re- 
quired conclusion M, Ov = N. Thus this latter sequent is deducible by LMs. BM 


The calculus LM; shows one single obstacle to contraction freeness, viz. the 
presence of the principal formula of an EDd-inference in the y-premisses. This 
obstacle is removed by considering the calculus LM having the usual axioms, the 
rules EV and EDs of LM; and the rule ECld in the form 


M?,U)5,a41 > N ... M?, Oð, ap > N 

M*,Do => b, N ... M2, Oò > b, N 
M? Oc >N ... M?,Qc,>N 
M3,0v > dı ... M3, Ov > d, 

BH—— Muns N ~ 
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In order to proof completeness of this calculus we call a formula Ov distant in 
a deduction d of a sequent M, Ov = N iff v is deep and below any conclusion of 
an inference with principal formula Ov there is a conclusion of an EV-inference or 
an a—premiss or a -premiss. Then we reproof Lemma 15 as: 


LEMMA 16 Any LM;-deduction d of a sequent M = N may be transformed into 
an LM-deduction of the sequent M^ => N, where M* results from M by replacing 
any number of formulas Ov distant in d by (Ov. 


Proof. If the last rule applied in a given LMs-deduction of our sequent s = M > 
N is EV, then all premisses are deducible by LM by the induction hypothesis, and 
moreover they are the LM-premisses of an inference leading to M4 => N. There- 
fore this sequent is deducible by LM, too. If the last rule applied is EDs, then the in- 
duction hypothesis gives us LM-deductions of all 9-premisses, and these are the re- 
quired LM-premisses for an inference leading to M^ > N. But in the a-premisses 
all deep formulas are marked by a ©. Thus we may also use the a-premisses of our 
LMg-inference as a-premisses of an LM-inference leading to M^ — N. Finally if 
the last inference is by an application of ELld with principal formula Ov and there is 
a y-premiss P of this inference in which Dv is not distant, then there is a chain of y- 
premisses preceding P which contains another occurrence of P. In this case the last 
inference may be dropped. Otherwise the formula Ov is distant in the deductions 
of all 4-premisses, and furthermore all formulas distant in our given deduction of s 
are distant in all -premisses, too. Therefore by the induction hypothesis we obtain 
all the ^-premisses necessary for an LM-application of EDdd leading to M^ > N. 
But the a- and 8-premisses for such an inference are obtained as before, hence in 
all cases we arrive at an LM-deduction of the required sequent M^ > N. E 


The calculus LM now has the desired property that there is a measure u, such that 
in every one of its rules the measure of the conclusion is greater than the measures 
of all premisses: namely we may take u(s) for a sequent s to be (the total number 
of connectives of s times (the number of Li's plus the number of ()'s)) minus the 
number of ()’s, and moreover there holds the 


THEOREM 17 A sequent is valid in S4 if and only if it is deducible by LM. 
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TRANSFER OF SEQUENT CALCULUS 
STRATEGIES TO RESOLUTION FOR S4 


1 INTRODUCTION 


This paper presents a general scheme [10, 11] of transforming a cutfree Gentzen- 
type system into a resolution type system, preserving the structure of derivations. 
This is a direct extension of the method introduced by Maslov [7] for classical pred- 
icate logic. In this way completeness of strategies 1s first established for the Gentzen- 
type system, and then transferred to resolution. The method 1s illustrated here for 
the propositional S4 in a way very similar to [13]. An adaptation of the Maslov’s 
method to S4 and some of the more elaborate strategies mentioned below are pre- 
sented in [17]. Let us recapitulate some material from [10, 11, 12]. 

The main idea of Maslov’s method can be summarized as follows: 

A resolution derivation of the goal clause g from a list I' of input clauses can be 
obtained as the result of deleting I' from the Gentzen-type cutfree derivation of the 
sequent I > g. 

Recall [10] what we mean by a resolution method for a formal system C. Such 
a method is determined by specifying: 


1. A class of formulas called clauses. 


2. A method of reduction of any formula F of the system C to a finite list I p 
of clauses. 


3. An inference rule E called the resolution rule for deriving clauses. 


4. The derivation process by forward chaining so that all derivable objects are 
consequences of initial clauses, and garbage removal from the search space 
is possible. 


The resolution method is sound and complete iff for any formula F', the deriv- 
ability of F in C is equivalent to derivability of the goal clause g from I p using 
the rule R. Reduction of an arbitrary formula F to the form Tp 2 g where T is 
a finite set of clauses and g is a goal (variable or Ø) is based on the familiar depth- 
reducing transformation by introduction of new variables. 


17 


H. Wansing (ed.), Proof Theory of Modal Logic, 17-31. 
© 1996 Kluwer Academic Publishers. 


18 GRIGORI MINTS, VLADIMIR OREVKOV AND TANEL TAMMET 


2 GENTZEN-TYPE FORMULATIONS OF S4 


Consider a Gentzen-type formulation of the predicate calculus S4 from [11] suit- 
able for pruning superfluous formulas (cf. below). Its derivable objects are sequents 
D => A, where I’, A are finite (possibly empty) lists of formulas of the language 
considered. The order of formulas in I’, A will always be disregarded, hence L', A 
etc. are treated as multisets (number of occurrences of formulas is important). 


Gentzen-type system GS4 


Axioms: A => A. 
Inference rules: 


Aı,Tı > Ai;..-;An Ta => An 


AV--VAGTSA (v=) 
where, U...UT, =T, A1 U...U An =A. 
I > A, Aj,,...,Ai,, (>v) r> A^ (weakening) 
SAAV. VA, I,D-AQA e I 


where 21 <... « iQ, € n. 


(=> ) ATS A TSA | =>) 
“PSA TA SATSA 


Modal rules: 


A, (0AP, T> A Or s A 


O [])-— ————— 
(>) CHATSA (= Dn 3 GA 


where superscript 0 means possible absence of the formula. So in fact we have two 
versions of the rule (D =): 


A,D2A ADAT >A 
OA, T> A OA,T>A 


Let us fix terminology concerning Gentzen-type systems. In the sequent T > A 
the left-hand side I and right-hand side A are sometimes called antecedent and 
succedent. In each inference rule the sequent written under the line is the conclu- 
sion, and the sequents over the line are premises. The formula shown explicitly in 
the conclusion, for example A; V --- V A, in (V =>) or I", A’ in (weakening), 
is the main formula, the formulas shown explicitly in the premises, for example 
A1,..., Án in the rule (V =), are side formulas, and the remaining formulas, for 
example T'i, Aj,...,0,,An,P, A in (V>) orT, A in (weakening), are paramet- 
ric formulas. 
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Pruned derivation is one containing weakenings only immediately preceding the 
last sequent of the derivation (cf. [5]). A derivation is p-inverted if propositional 
connectives are immediately analyzed. More precisely, if a sequent 5 is not an ax- 
iom and contains a formula of the form A V B, 4A, then one of these formulas is 
the main formula of the inference L introducing S. Moreover, if some of the side 
formulas of the inference L is again of this form, one of them is the main formula 
of the inference immediately preceding L. 

Recall the definition of the sign of a subformula occurrence in a formula: posi- 
tive subformulas occur within the scope of even number (for example zero) of oc- 
currences of negations and the premises of implication. Non-positive occurrences 
are negative. The proof of part (a) of the following statement can be found in [3], 
the proof of (b) is standard, (c) and (d) are well-known. 


THEOREM 1 
(a) Formula F is derivable in S4 iff the sequent => F is derivable in GS4. 
(b) Any provable sequent has a p-inverted pruned derivation. 


(c) The derivation of a sequent S uses only rules for connectives occurring in 
S, or more precisely, the succedent and antecedent rules corresponding to 
positive or negative occurrences of connectives. 


(d) A list I' of formulas is inconsistent in S4 iff the sequent T = (with empty 
right-hand side) is derivable in GS4. 


2.1 Bottom-up Proof Search 


Cut-free systems like GS4 suggest proof search from goal to subgoals. Initially one 
has goal formula (or sequent). Each subsequent step replaces some of the current 
goals by subgoals from which it can be derived until all leaves are axioms. Sub- 
goals are written on the top of the goal, so the whole procedure is working bottom 
up. 


EXAMPLE 2 Here are stages of proof search in GS4 where weakening is attempt- 
ed only for testing axioms. 


Goal Step1 Step2 


Oa > O(a = veeb) 
=> La, O(a V b) 


=> -Da V O(avb) —————————  --Da,0(a v b) 
> -Dav B(a v b) ————————— 
= -0a V O(a V b) 
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Step3 Step4 Stepó 
a=>A 
[la > a 
[la 2 aV b ———— [la > a 
—————— Oa>avb ——— 
Oa > O(a V b) —————— OaToa V b 
——— Oa > O(a V b) ——— 
= =a, O(a V b) ——————— Oa > O(a V b) 


= —Lla, O(a V b) 


————————— cz» ^D, O(a V b) 
= -Da V O(a V b) 


= Aa V O(a V b) 


=> -Da V O(a v b) 


3 INVERSE METHOD 


3.1 System S4p 


Instead of the bottom-up proof search described in the previous section, we con- 
sider here, following Maslov [6], top-down proof search by direct chaining. 

We begin with axioms, derive everything possible by one inference (applica- 
tion of an inference rule), then derive everything possible by two inferences etc. 
until the goal formula appears. Thus the proof search proceeds in the direction 
opposed to bottom-up search described in the previous section, which was used 
in most advanced programs during the first period of automated deduction. That 
is why Maslov used the term inverse method. Both the language and rules of the 
present formulation are determined by the goal formula F'. Introduce new distinct 
propositional variables called labels |, for all subformulas A of F which are not 
literals (propositional variables and their negations). For literals L put lg = L. 
Derivable objects are sequents T = A where T, A are lists of labels. 

Axioms and inference rules of the system 547 are obtained from the axioms and 
inference rules of G.S4 by replacing all formulas by their labels. 


EXAMPLE 3 Let F = -Ua V O(a V b). 
Here is a derivation of > lp in the system S4p . 
a>a 
lo. >a 
loa > lave 
loa = loaves) 
=> laoa, lo(avs) 


=> l5navü(avb) 


DEFINITION 4 Notation d : A4,...,Às => B means that d is a derivation of 
the sequent A41,..., An => B. 

If Ay,...,An,B are subformulas of F, and d : A,,...,An => B then lq de- 
notes the result of replacing all formulas (sequent members) in d by their labels. 
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EXAMPLE 5 Let d be the derivation in GS4 of the formula F from the Example 
2 , i.e. the result of the last step 5. Then l4 is a derivation from the Example 3. 


THEOREM 6 Ifd: Aj,...,An > Bi,...,Bm isa derivation in GS4, then it is 
isomorphic to la : l4,,...,l4, => lB,,...,lp,, asa labelled tree. 


"n 


Proof. Proof is obvious. i 
COROLLARY 7 GS4F F iff SAp. - lp 
Proof. Take d :— F and la :=> lr. a 


Our derivable objects are clauses up to replacing l1, ...,lk => Uk41,..-,ln by 
ali V... V aly V iggy... V ln. It will be seen that this view is fruitful. 


3.2 Strategies of Inverse Method 


First of all, the language can be restricted. The following signed subformula prop- 
erty is obvious. 


LEMMA 8 Ifd :=> F is a derivation in GS4, and sequent 
Aj,...,An > Bi,...Bm 
occurs in d, then A,,..., An are negative subformulas of F, and B,,... Bm are 


positive subformulas of F. 


From now on the language of the system S4r is restricted to sequents of the 
form E 


(1) l4,,...,lA, > lB,- lB 


where A1, ..., Án are negative subformulas of F’, and B1, ... Bm are positive sub- 
formulas of F . 

The previous Lemma shows that this restriction is still complete: if F' is deriv- 
able in S4, then lp is derivable in S4 Fr. 


EXAMPLE 9 Complete proof search for the formula F = Oa V O-a in S4r. 
In view of the restriction (1), only the axiom 
a>a 


is possible. No antecedent inference rule is applicable, since there are no compos- 
ite (non-atomic) negative subformulas of F. The rule — O is not applicable to a 
sequent containing more than one formula not beginning with O. Hence only one 
succedent rule (= —) is applicable to the axiom , resulting in 


=> 0,7 


The only rule possible now under restriction (1), is => O, but it is not applicable 
since there are two succedent formulas. This concludes the search: F' is not deriv- 
able. | 
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Inversion Strategy and Subsumption 


Recall that the propositional rules of the system GS4 (Section 2) are invertible if 
the rules for V are taken in their non-pruned versions : 


r> A,A, B AD2A B,U2A 
r> A,AVB AVB,T>A 


The conclusion of a propositional rule is derivable iff all premises are derivable. 
This motivates the following definition. 


DEFINITION 10 The label l4 is p-invertible if A has one of the forms BV C, 4D. 
p-inversion strategy for S4r is the following restriction: if a sequent is not an 
axiom and contains invertible label, then it is obtained by a propositional rule. 


Note a consequence of the p-inversion strategy for direct chaining: a non-p- 
invertible rule cannot be applied if the result contains a p-invertible label. For ex- 
ample, sequent lo4,lcvp => leva can be obtained (according to inversion strat- 
egy) by the rules V >, => V , but not by the rule O =. Hence this sequent cannot 
be generated from the sequent l4, loy p > leva. 


LEMMA 11 p-inversion strategy is complete. 


Completeness of the p-inversion strategy for GS4 is stated in Theorem 1(b), and 
is preserved by our transformation from GS4 to S4p. O 


Note that the rule > O 
Ul A 


DP s OA 
is also invertible. This motivates the following definition. 


DEFINITION 12 The label l4 is invertible in a sequent S if A has one of the forms 
BvC,3DorAis OB and S is loa, V...V loa, V lap. 
Inversion strategy for S4r is the following restriction: if a sequent is not an 


axiom and contains invertible label, then it is obtained by a rule introducing such 
a label 


LEMMA 13 Inversion strategy is complete. 
Completeness of the corresponding inversion strategy for GS4 is proved in a 
standard way: cf. Section 8B.1 in [3]. It is preserved by our transformation from 


GS4 to S4p. O 


DEFINITION 14 Sequent T' = A subsumes the sequent T" > A’ iff CI', AC 
A’. | 
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Note. It is also possible to extend this definition using implication OA => A, but 
this requires some care. 

Subsumption strategy allows one to delete from the search space any clause which 
is subsumed by the remaining clauses. It is complete for the same reason as the 
analogous strategies for resolution, but its compatibility with other strategies should 
be verified in each particular case. 


4 RESOLUTION 


We reproduce some material from [8, 9, 10] and connect it with Section 3. 


4.1 Modal clauses 


Recall from the section 3.1 that the label [A stands for a subformula A of the goal 
formula F. This can be expressed by postulating equivalence 


D(l4 + A) 
or two strict implications 
(2) UüU(l4- A) O(A — la) 


Most often only one of the implications (2) will be preserved: A — l4 for positive 
subformulas A, and |4 — A for negative A. Some of the implications (2) can be 
simplified, for example a V b — c can be replaced by two implications a — c, b 4 
c. Finally, implications a + D are replaced by clauses ^a V D. 

We define literals as atoms and their negations and denote them by 1,l,.... 
Modal literals are by definition expressions of the form }, Ol, Ol where © is treated 
as 3D-. Modal literals are denoted by L, M, N, Li, Mi, N,,.... Complements 
are defined by ~Ol = Ol, =O} = Od in a natural way. 

Propositional clauses are disjunctions of literals. Modal clauses (or simply 
clauses) are disjunctions of modal literals. Initial modal clauses are expressions 
of the form OC where C is a modal clause. As usual, clauses are treated as sets of 
literals, i.e. up to the order and number of occurrences of literals. Hence there is an 
implicit contraction rule. To simplify notation we require that C in an initial modal 
clause contain at least two terms. Reduction of arbitrary formula to a clause form 
is given by the standard depth-reducing transformation. 


THEOREM 15 Forany formula F one can construct (by introduction of new vari- 
ables) the list X p of initial clauses and a propositional variable g such that 


Fs4 F iff Fs &Xp-9 


Proof. This is Theorem 3.1 of [10]. a 
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4.2 Resolution Calculus RS4 


Derivable objects: modal clauses and initial modal clauses, the latter only as initial 
objects of derivations. We are interested in derivability relations X - C where X 
is a set of initial clauses, C is a modal clause. 


Axioms: LV -L for modal literals L which occur both positively and negatively 
in X F C. 
Inference rules: 


DVL Ev AL IVD iV OD DD 


(R) DVE oiv D C? üivop ©) pu) 


Here D, E denote modal clauses. OD stands for the result of prefixing © to all 
literals in a propositional clause D, while OD is the result of prefixing O to the 
whole of D. 


Notes. 


1. The rule (O7) can be restricted to apply only for bootstrapping initial clauses. 
Cf. Section 4.4. 


2. All rules are obviously valid for derivability from O-formulas in S4. 


3. Important derived inference is the clash rule [2] where the whole clause is 
resolved: 


LiWV...VL4 Dı V AL, e Dan V ^L 
Dı V... V Dn 


4. In our present formulation (but not in the formulation of the section 4.4 
below) it is possible to avoid postulating tautologies L V ^L as the axioms. 


5. Subformula property for R.S4: a literal occurs in a derivation of X F C only 
if it occurs in X, C up to ~, O. The latter can happen when rules (©), (0) 
are applied. 


The notation for derivability in RS4 is X Fr C 


EXAMPLE 16 Here is a derivation of O(-a V Ob), O(a V c) Fr Ob v Oc. 


D(-a V Ob) C(a V c) 


~a V Ob avc 
bve 


Ob V Oc 
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4.3 Maslov Transformation 


The translation GR into R.S4-derivations is defined here for pruned G S4 -deriva- 
tiond := F where F isa formula. The first step transforms d into a 54 p-derivation 
la := lp (cf. Definition 4 and Theorem 6 above). Note that any sequent in lg has 
the form 


(3) lA, sl A, => lp... lB, 


where B; are positive subformulas of F, and A; are negative subformulas. 
We obtain GR(d) by replacing sequents (3) by 


ala, V...V-l,, Vip, V... Vlgp 


m. 


and adding necessary input clauses to construct correct resolution inferences. 
Some contractions intervene since clauses are handled as sets, but they are easily 
dealt with. Details are in [10, 11]. Cf. also next section. 


The transformation RG from a derivation in RS4, d : X F g, into the derivation 
in GS4 can also be easily defined if d uses only GR(d)-rules from the Section 4.4 
below. 


THEOREM 17 (Soundness and Completeness Theorem) Let F' be a modal formula, 
and X p, g be as in Theorem 15. 

(a) If d :— F is a derivation in GS4, then GR(d) : Xp + g(or Xp - 0) isa 
derivation in RS4 , where Xp is a sublist of X p. 

(b) If d : Xr FĀ g (or Xp | 0) in RS4 then RG(d) : Xp => g is the derivation 
in GS4. 

(c) sa F iff Xr ^ nsa g. 


Proof. Cf. Theorem 2.4 in [11]. Ba 


EXAMPLE 18 Let F = 70a V O(a V b), and let d := F be the derivation from 
the Example 3. Then l4 is presented in Example 5, and G R(d) is the following 
derivation 


D(las V lana) 
O(lp V lana) — loa V l-oa 
O(lp V—loave)) lr V al-oa laoa V ln(avo) 
lr V loaves) lr V loave) 


d: aloa V lo(ave) 
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where d' : aloa V la(avo) is 


O(lave V Al.) ala V la 
lave V aq 3l, V la 


O(lo(avė) V 7Ol evs) aOl, V lave 
D(-as, V Ola) — loaves) V 7Olave 7Ul, V Olavo 
aloo V Ol, . Ul, M ln(avb) 


oloa V ln(avb) 


4.4 Strategies of Resolution. Transfer from Gentzen-type Systems 


We see that the structure of G R(d) is the same as the structure of d. This allows us 
to carry over strategies. Note that some strategies known in the classical case are 
obviously incomplete. One example is hyperresolution. A rich source of complete 
strategies is the transfer from Gentzen-type systems via the transformation GR and 
its analogs. Itallows us in particular to prove the completeness of a powerful analog 
of hyperresolution due to V. Orevkov [15]. Let us first describe in more detail the 
structure of the derivation G R(d) from the previous theorem. Derived objects are 
clauses 

(4) a- V... l4- Vig: V...lp+ 
where A; are negative subformulas of the goal formula F’, and B} are positive 
subformulas of F. In the following list we denote resolution-type rules by the same 
symbols as the rules of GS4 from which they are derived. 


Note. Only the following rules are actually used in G R(d). 
GR(d)-rules : 


D(la,vAa; V 2l4,) DV la; 
D V lAivA; 
O(-l4,va. Vla, Vla) DVv-la, D' Vola, 
alavaa V DV D' 
O(l.4Vla4) DVola, O(-li4 Vala) DVI, 
(= 7) DV ili, S >) D V A54 
o) ala VD ) la v-04 v...7A0l,, 
-0l VD Ola V-04 V...701, 
(a =>) O(-lo, V Ola) AOl, VD ( D) D(la4 V 2OlA) Ol, v D 
alo, V D loa VD 
Note that only some instances of the rule (R) are needed, and rule (O7 ) is subsumed 


into other rules. Note also that all rules except the rules (C), (OC) have initial 
clause as one of the premises. In any derivation of the form G R(d) the rule 30 is 


(= v) 


(V 2) 
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always combined with the rule (O =), and the rule O is always combined with the 
rule (=> O). One can replace these four rules by the two rules: 


DO(-lna V Ola) ala v D O(loa V 2QlA) la V Dn 


C C 
( >) alo, V D (> ) loa V Dao 


where Dg is aloa, V... nA, . We transfer to GR(d)-rules notation of main and 
side formulas from the sequent calculi. Main literal of each rule except (^O), (C1) 
is the literal l4 explicitly shown in the conclusion (maybe negated). For exam- 
ple, main literal of both rules > V, V = is Láva. Main literal of both rules 
(7Q), (Q) is Oly. Side literal of a rule is any literal l4 shown explicitly in the 
non-initial premise of the rule. More precisely, side literal of the rule > V is l4;, 
side literals of the rule V = are l4, , lA,, and side literal of the next four rules is / A, 
and side literal of the two remaining rule is Ol 4. Main literal of an axiom L V ^L 
is L. Main or side formula of an axiom or rule with main or side literal /4 is A. 

Restriction to clauses of the form (4) and to the rules used in G R(d) already con- 
stitutes a strategy. S. Maslov called similar restriction for classical logic inverse 
method strategy of the resolution method. The reason for this is clear from the sec- 
tion 3. 

Atomic axioms 

Axioms can be restricted to 14 V -l, for atomic formulas A, since similar re- 
striction for the axioms A => A is complete for GS4. 

Inversion strategy for RS4 is the restriction: if a clause contains at least one 
literal +14 with A = ~B, C V D (non-atomic B), or is of the form loa V Dno then 
this clause is obtained by one of the rules > 5 >, > V >, > 0. 


LEMMA 19 Inversion strategy is complete for RS4 


Proof. As in Section 3.2, take a pruned inverted derivation in G.54, and apply Maslov 
Transformation G R. a 


Unique occurrence strategy This is a strategy combining several inferences 
into one block and allowing to delete intermediate inferences in the block from the 
search space. A general scheme of the strategy as well as its form for S4 was in- 
troduced by A. Voronkov in [17]. An analogous strategy for linear logic [4] was 
independently introduced in [16] and was one of the decisive factors of the success 
of Tammet's prover for linear logic. 

A positive literal /4 is dischargeable, if A has exactly one positive occurrence 
in the goal formula F, A Æ F and A does not occur as an argument of O. 

A negative literal ~l 4 is dischargeable, if A has exactly one negative occurrence 
in the goal formula F’, and either A occurs as an argument of O or A occurs as an 
argument of ^ and A 4 OA’. 

Unique Occurrence Strategy. If a clause C contains a dischargeable literal l4 or 
—l, then immediately apply a rule with l4 or -l, as a side literal to C and delete 
C itself from the search space. 
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THEOREM 20 Unique occurrence strategy is complete. 
Proof. The proof can be found in [17]. E 


Focusing strategy [15]. This strategy is known for linear logic [1], cf. also [16]. It 
was first discovered for bottom-up proof search in classical and intuitionistic logic 
in 1974 by V. Orevkov [15]. 


DEFINITION 21 Let d be a derivation of X p | lp according to G R(d)-rules and 
T be an inference (application of an inference rule) in d, S be one of the premises 
of T, and lg be the side literal of T in S. Inference T is (+) -focused on S, if S is 
an axiom or the following conditions are satisfied. 

(a) if B = ^C,C v D, or (B = OC and lg is not negated in S), then S is the 
conclusion of the inference with the main literal lg. 

(b)if B = OC, lg is negated in S and T is (D = ), then S is the conclusion of 
the inference with the main literal lg. 

(c) if B is atomic, and lg occurs in S unnegated, then S is an axiom. 

Derivation d is (+) -focused, if it is (--) -focused on all premises of all its infer- 
ences except possibly (=> 0). 


(—) -focusing is defined by changing sign in the clause (c) of the previous defini- 
tion: 
(c’) if B is atomic, and ~l g occurs in 5 , then S is an axiom. 
+-focusing strategy: generate only derivations +-focused on all subformulas. 
—focusing strategy is defined similarly. 
Note. Compare clause (a) above with the inversion strategy. 


THEOREM 22 The (+) -focusing is complete, as well as the (—) -focusing. 


Proof. We establish completeness of similar strategy for a modification of GS4 
and apply Maslov transformation. B 


Gentzen-type system G.54» is obtained from G S4 by restricting formulas in the 
axioms to be atomic, and replacing the rule = V by two rules: 
rT s A,A; Ai V...VAn r> A,A; 


TSAA,V...VA, >Y PSAAV...v4A, 


Definition of the (+) -focusing for G.S4*-derivations is very similar to one given 
above for RS4: 


DEFINITION 23 Let d be a derivation of a formula F in GS 4x, T be an inference 
in d, S be one of the premises of T, and lg be the side literal of T in S. Inference 
T is (+) -focused on S, if S is an axiom or the following conditions are satisfied. 
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(a) if B = 2C,C V D, or (B is a succedent formula of the form QC) , then S 
is the conclusion of the inference with the main formula B. 

(b)if L is (D =) and B = OC, then S is the conclusion of the inference with 
the main literal B. 

(c) if B is atomic succedent formula, then S is an axiom. 

Derivation d is (+) -focused, if it is (+) -focused on all premises of all its infer- 
ences except possibly (= Q). 


Completeness for G R-rules is reduced to the following proposition. 


THEOREM 24 Every derivation d in GS4+ can be transformed into a (+) -focus- 
ed derivation with the same endsequent. 


Proof. We can assume d to be pruned and all axioms atomic. Uppermost non- (+) 
-focused inferences are normalized by permuting them maximally upward into the 
place where they are already (+-) -focused. More precisely, we use induction on the 
size n(d) of non-focused part of given pruned derivation d, that is on the number n 
of inferences which are below any premise of a non-focused inference. Pick up one 
of the uppermost non- (+) -focused inferences. Assume (to save notation) that it is 
(= V)-inference with main formula AV B and side formula A. Since the derivation 
d is pruned, the side formula A is traceable up the derivation to main formulas of 
some inferences or axioms: 


A! => II’, A’ A" — II", A" 
^-ILA A= A 


/ 


| 
DEA ,. 


lD2—3X,AVvB 


where the part ending in l = X, A is (+) -focused. Permuting inference L upward 
to the places shown explicitly results in the following derivation : 


A s II', A! A" > II", A" AA 
A —ILA L A AVB 
A-—ILAVvB 


| / 
[D 2—X,AVB 


L 


where the part ending in I' > X, AV B is (+) -focused, and the induction parameter 
is decreased at least by 1. 
In the case of two-premise inference, permutation should be repeated. H 


To prove the completeness for resolution rule it remains to apply Maslov trans- 
formation. O 
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EXAMPLE 25 The following inference satisfies the clause (c) in the definition of 
+-focusing. 
O(a VipVv ^lavB) aa Va alg VD 
a V -lav B VD 


Recall from the section 3.1 that a = la for atomic a. 


4.5 Problems 


1. Find reasonable extension of Maslov transformation to propositional logics 
containing induction: dynamic logic, temporal logic of linear time etc. The 
difficulty : cut-free Gentzen-type derivations in these logics are graphs with 
cycles, not trees. Hence any syntactically admissible clause can be initial. 


2. Extend the present approach to Ohlbach translations of modal logics [14]. 
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HAROLD SCHELLINX 


A LINEAR APPROACH TO MODAL PROOF 
THEORY 


We show how to extend the proof theoretical analysis of sequent derivations in clas- 
sical logic by means of linear logic, as exposed in [2], to sequent derivations in S4, 
by constructing embeddings of S4 into classical linear logic that are correct with re- 
spect to proofs (i.e. define linear decorations of S4-derivations). Among the imme- 
diate corollaries are cut elimination for S4, and reduction strategies that are strongly 
normalizing. 


“dedicated to dj" 


1 AIMS 


Linear logic! bans the structural rules of weakening and contraction from the for- 
mulation of classical logic as a sequent calculus, and re-introduces them in modal- 
ized form: structural manipulation to the left of the entailment-sign is allowed only 
for formulas prefixed by “!”, to the right only for those prefixed by “?”. (‘Linear’ 
logicians refer to !, ? as the exponentials.) The resulting calculus is a proof theo- 
retical jewel, which combines the deep symmetries of classical, with the computa- 
tional properties (strong normalization, confluence) of intuitionistic logic, without 
loss of expressive power. 

If one forgets the linear ‘type’ of the connectives appearing in a derivation in lin- 
ear logic (i.e. one replaces (&, &) by A, {@, p} by V, and — by —), and erases all 
exponentials, then (modulo possible repetitions of sequents) the result (7's skele- 
ton, sk(7)) is a correct derivation in sequent calculus for classical logic. Conversely 
each derivation in intuitionistic or classical sequent calculus occurs as the skele- 
ton of a derivation in linear logic: there exist modal translations (-)? of formulas 
into linear formulas, obtained by replacing each connective by one of its linear ana- 
logues and prefixing each subformula by a modality (i.e. a (possibly empty) string 


1(3]. A good introduction is [8]. 
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of exponentials), whose inductive application to a sequent derivation 7 in classical 
logic results in a correct derivation 7°, satisfying sk(7?) = 7, in linear logic.” 


decoration 


skeleton 


Thus we are able to study properties of derivations in intuitionistic or classical 
sequent calculus via their linear decorations. For example, cut elimination for clas- 
sical and intuitionistic (second order) logic becomes an immediate corollary to cut 
elimination for linear (second order) logic, as reductions of the linear decoration of 
a proof 7 become, via the method of ‘reflection’ or ‘pull-back’, reductions of the 
original: one decorates the original, reduces the decoration in linear logic, and takes 
the skeleton of the result (cf. the diagram above). This procedure defines reductions 
of classical derivations that are strongly normalizing and confluent. (PROOF: The 
reductions of the decoration have these properties.) 

Our method moreover enables us to identify possible obstacles to a decent com- 
putational interpretation of cut elimination in classical sequent calculus, and to for- 
mulate solutions as suggested by linear logic. For all details, and much more, we 
refer the reader to [2]. 


The purpose of this note is to point out that this ‘linear analysis’ of derivations 
and their normalization is not limited to sequent calculus for classical logic, but 
applies, virtually unchanged, to the (standard) sequent calculus formulation of the 
modal logic S4 as well. 


Though classical linear logic is essentially different from S4, there are also pro- 
found resemblances between the systems. And it probably is largely due to linear 
logic’s rather unconventional notational apparatus, that many still appear puzzled 
when confronted with the fact that, in some sense, linear logic might be deemed a 
fragment of S4. Consider the introduction rules for the exponentials (the 'derelic- 
tion’ and the ‘contextual’ rules) in table 1. When reading O for ‘!’ and Q for “?’ one 
finds precisely the modal introduction rules one needs to add to classical sequent 
calculus in order to obtain a sequent calculus formulation of $4: the only formal 
difference between the implicational fragment of S4 and the calculus of table 1 is 


?]n fact, as shown in [5], there are essentially two such translations. 
We refer to them as t and q. 
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Identity axiom and cut rule: 


r> A,A A,I’ = A’ 


Ax A>A cut DSA, A 
Logical rules: 
Lo [244A BT >A’ R> r,A>B,A 
L,I", A — B > A,A r>A-~-B,A 


Exponential structural rules: 


W! r> A W? DA cib A IA => A co Lt, ?A, A 


riAsA r>?A,A ' TIASA ' T=>?A,A 


Exponential contextual rules: 


L? TAS?A R! T=>A,?A 
“IP, ?A>?A ‘IT => !1A,?A 
Exponential dereliction rules: 
go l0 44 pL bA-A 
r>?A,A D,!'A2 A 


Table 1. The implicational fragment of classical linear logic 


that in the latter the use of structural rules is restricted, at the left to formulas pre- 
fixed by ‘!’ (LI), at the right to formulas prefixed by '?' (0). So taking the skeleton 
of a derivation 7 in linear logic, as described above, in fact is a compound opera- 
tion: forgetting the linear ‘typing’ of the logical connectives takes us to S4; erasing 
the exponentials then leads from S4 to classical logic. 

Hence every derivation in linear logic has, let’s call it a modal skeleton, which 
is a sequent calculus derivation in S4. 


skeleton 


But how about the converse? Does every S4-derivation occur as the modal skele- 
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ton of a proof in linear logic?? If we take ‘modal skeleton’ in the above, strict sense, 
this clearly can not be the case. A derivation like that of p, Op = p consisting in 
an (atomic) instance of the identity-axiom followed by a weakening on Op could 
never be realized in linear logic: p,?p = p simply is not derivable. However, in 
a slightly more devious sense, in which ‘modal skeleton’ will come to mean “for- 
get some exponentials, but keep others”, the answer to the question turns out to be 
affirmative. 


2 MEANS 


Of course! If we want to interpret S4-proofs in linear logic, we have to be able 
to distinguish the S4-modalities (which, as usual, are about ‘what might and what 
must be the case’) from the linear exponentials, which keep track of the manage- 
ment of the structural rules within the proof under consideration, and, dually, in- 
form us on its future behaviour under cut elimination. (By the way, as we are talking 
proof theory, cut-free (normal, direct) proofs will not interest us very much; only 
‘how we get there’* by means of operating on ‘real’ proofs: yes, yes, cuts indeed, 
lots of cuts!) 

How to distinguish between modalities serving a different purpose, but obeying 
the same (introduction) laws? Well, recall that these (contrary to those for the logi- 
cal connectives) do not determine the modalities uniquely (modulo provable equiv- 
alence). So, in linear logic (as in S4) we can add a second, a third, fourth, etc., pair 
of modalities to the language, plus, for each, the usual left/right introduction rules 
to the calculus. These new pairs of modalities will be independent of the original 
one (and of each other). This often is considered a vice, but for our purposes it is 
a virtue. (It will however turn out to be convenient for the fulfilment of our task to 
nevertheless stipulate some kind of relation between these different exponentials.) 

We introduced ‘linear logics’ using (ordered) sets of distinct (indexed or 
‘coloured’ ) exponentials in [1]. Let us recall their definition: given some preorder 
R = (R, x), the language of the multicolour linear logic R-LL consists in the non- 
modal part of the language of usual linear logic, extended with pairs of indexed 
exponentials !, ?, one for each element of R. To the usual non-modal rules of lin- 
ear logic we add 


1. for each pair of indexed exponentials the usual dereliction rules R? and L!; 


2. structural rules to the left for formulas prefixed by !, structural rules to the 
right for those prefixed by ?, eventually restricted to indices from subsets W 


31t is important to observe that this is more than merely asking for an embedding of S4 into linear 
logic, correct with respect to provability. That in a such sense S4 may be embedded in linear logic is 
shown in [4], by means of a simple extension of the Grishin-Ono translation of classical into linear logic 
(which can be looked up e.g. in [8]). 

4Us ‘linear folks’ call this: dynamics. 
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(allowing weakening) and C (allowing contraction) of R, that are upwardly 
closed with respect to < (i.e. if a € W(C) and a < b than also b € W(C)); 


3. rules for the introduction of indexed exclamation marks on the right, namely 


1G... Di,..., Dv 


AG, Sl Gs => lA, Di. Dm 


y 1 d yn 


! 9 
R! ay Gn = Ay 


) 


provided that z < z;,z < yj for all z;, yj, and, under the same condition, a 
rule for the introduction of indexed question marks on the left: 


AG sul Gn, A S Di... Ds 


y1 m 
au Gn 2A => Di... Dm 

Itis easy to verify that all LL-properties (cut elimination, strong normalization, 
etcetera) are inherited by R-LL. (PROOF: Note that if we forget about the indices, 
a derivation in R-LL is just a derivation in LL; we therefore eliminate cuts as if we 
were in LL, which is correct from an R-LL point of view due to transitivity of < 
and the closure conditions on W and C.) 

As a matter of fact, for each preorder R and choice of W and C, the set of deriva- 
tions in the calculus R-LL corresponds to a certain (not necessarily proper) subset 
of the collection of all LL-derivations, a subset satisfying certain restrictions on the 
use of exponential rules and which is closed under cut. 

In order to interpret S4 we are going to use a particularly simple preorder, 
namely the usual <-relation on the set (0, 1}. So we have two pairs of exponentials: 


I ? and !, ? (the former can be introduced in a context containing the latter, but the 
converse is forbidden). Moreover (but this is not essential) it turns out that we can 


take W = C = (1). The pair |, ? will play the role of the S4-modalities O, Q; the 
! ? 


pair ;, ; is the usual LL-pair !, ?. (And, of course, we can and will just write 0, © 


for i, 2, and !, ? for L ?) Let us call the corresponding (bi-coloured) linear logic: 
2-LL. We will refer to D, © as the 0-indexed exponentials in a 2-LL-derivation, to 


!;? as the /-indexed exponentials. 


Our second ingredient are the (dual) translations t and q of classical into linear 
logic that we mentioned above, translations having the special property of giving 
rise to decorations of derivations in sequent calculus: they extend to translations 
of proofs. 

The mappings t and q correspond on the nose to two, again dual, ways to treat 
the 'structural phase' (neither of the cut formulas has been introduced in the last 
inference before the cut) when eliminating cuts from a classical proof: the t-transla- 
tion corresponds to first permuting the cut upwards in the subderivation of the right 
premiss, while q corresponds to first permuting the cut upwards in the subderivation 
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of the left premiss (cf. [5], [2]). Using the translations uniformly corresponds to 
persistently applying either the right- or the left-protocol. . 

Another option is to first apply a marking to a given derivation 7: call occur- 
rences of (sub)formulas in 7 eithert or q, the only limitation being that this marking 
should respect identity-classes (i.e. corresponding occurrences of formulas in pre- 
miss and conclusion of a rule get the same marking, corresponding occurrences of 
subformulas in the two formulas of an axiom or in the two cut formulas are marked 
identically, corresponding occurrences of subformulas in contracted formulas are 
given the same sign, etcetera?). We will note the conclusion of a marked deriva- 
tion z as I1, 5 = At, A3, where I} denotes the multiset of all formulas in the 
antecedent of the conclusion that are marked t, etcetera. One then defines a transla- 
tion of marked formulas, which once more, when applied inductively to a (marked) 
derivation results in a decoration of 7. (Of course the uniform translations are the 
two limit-cases (‘all q, all t’) of the ‘mixed’ translation, which, obviously, corre- 
sponds to a cut-elimination that ‘mixes’ protocols (i.e. left or right depends on the 
marking of the cut-formula). 

The LL-decorations of classical derivations thus obtained are decorations in 2- 
LL as well: an LL-proof is nothing but a proof in 2-LL that does not use 0-indexed 
exponentials. 

Hence the idea will be clear. We are going to extend our translations to S4, by 
mapping the S4-modalities to 0-indexed exponentials in 2-LL. The principle, all 
tricks and traps will already be sufficiently illustrated by considering the implica- 
tion-only fragment of S4. (By herself or with a little help from [2] the reader easily 
finds the extension to the full system.) 


The translation is defined as follows: for atomic formulas p, whether they are 
marked t or q, it is the identity, i.e. p; for compound formulas one translates ac- 
cording to the following table: 


aB +» ja Ialta 
+ || 74—28 | Ane || oa | ora | oa | 
a | tare | Aone || oa || ora | ona 


Given a marked formula ¢, we will denote by ¢* a 2-LL-translation of ¢ 
according to this table. So ¢* will depend on the markings of ó's (proper) 
subformulas. To give an example, if ọ = (O((L1A)3 — Bt)9)* then $* is 

Here is our first proposition. 


3 A formal definition can be found in [1] and [5] 


A LINEAR APPROACH TO MODAL PROOF THEORY 39 


PROPOSITION 1 Suppose that v is a marked S4-derivation with conclusion T} , 
I3 = At,A3. Then we can construct a 2-LL-derivation n* of ?T1,!T5 => 
? A1,?1A5, which moreover has the property that, by deleting all 1-indexed expo- 
nentials, one recovers (modulo possible repetitions of sequents) the original proof 
T. 


Proof. Of course our claim is shown to hold by induction on the length of marked 
S4-derivations. 

- It is easily verified for axioms, and if our last inference was one of the struc- 
tural rules, then note that by induction hypothesis the contracted, resp. weakened, 
formula(s) are/is prefixed by the ‘right’ exponential, and thus can be contracted/ 
weakened in 2-LL. 

- Suppose the last inference was a cut between T}, r3 = At, A3, A3 and A3,T'5, 
I4 => A$, Aj, then, by induction hypothesis, we have 2-LL-derivations of ... > 

.., !A* and !A*,...=> .... We proceed by applying an exponential contextual 
rule to this latter derivation, ‘promoting’ !A* to ?!A*; and we are done by a cut 
in 2-LL. (As for the structural rules, also the application of exponential contextual 
rules is always possible: all formulas are prefixed by the ‘right’ exponentials.) 

In case of a cut between T}, C3 = Aj, A3, At and At, T$, T] = As, Ad we 
have, by induction hypothesis, 2-LL-derivations of ... 2 ...,?A* and !?A*,... 
=> .... Now we proceed by applying an exponential contextual rule to the former 
derivation, promoting ? A* to !?.A*.° 

- In case our last inference is a left/right implication introduction rule we reason 
similarly. We leave the details to the reader. It comes to the fore in case antecedent, 
marked 


- What is left to verify is the case of the introduction of the modalities. Let us 
start by considering the ‘dereliction’-rules, e.g. LO. 

Suppose we got T}, T2, (ODA) > At, A3 from T$}, 12, At > At, Ad. By in- 
duction hypothesis we have a 2-LL-derivation of !?T7,!05,!?A* > ?AT,71A5. 
We continue as in: 


Ds IDs, I2A* => 2As, AS 
rs, IPs, CI? A*  2A2,?1A7 
UTT,IDS,!DI? A*  ?A$,?!A5 


We leave other markings of the active formula in case of LL] and all instances of 
the dual rule RO to the reader. So what remains are the contextual rules, e.g. LO. 


61f the cut is on a formula marked q, then, in ‘linear terminology’, in the decoration the derivation 
of the right premiss is a box, if it is on a formula marked t, the derivation of the left premiss is a box. 
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Let us suppose we derived (LIT'1)*, (OT2)9, (9A)! = (041)5, (0^2) from 
(OV; )*, (OT2)9, Aa => (OA, Jt, (9A2)9. We find: 


1(Or1)*, {Or2)*,!A* 2 ?(9A1)*,?(042)" 

I (ET3)*, (EIP2)*,?!A4* 2 ?(0A1)*, * (0A2)* 
* (ET1)*, (Ol 2)*, 0?1A* => ?(0A1)*, ? (0A2)* 
'(EIT4 )*, (DT2)*,?0?1A4* => ?(0A1)*,?((0A3)* 


Note that it is here (as in the other instances of LỌ and those of RO, all left to 
the reader) that we (and essentially so!) benefit from the special form of the 2-LL 
exponential contextual rules, which allows ‘promotion’ to an exponential having 
an index below the indices of the contextual exponentials: we may introduce 0/0 
in contexts where all formulas are prefixed by !/?. a 


So when inductively applying our translations, an S4-derivation 7 becomes 
‘wrapped up’ in thick layers of linear exponentials. (Many of these in fact will be 
‘superfluous’. They can be identified and removed using the techniques developed 
in [1]. This will lead to optimal (1.e. using a minimal number of exponentials) linear 
interpretations of out modal proof. But for the moment that's beside the point.) 


REMARK 2 Whereas for classical logic the converse of Proposition 1 trivially 
holds, this is no longer true in the present case: note e.g. that !À => ?!L]?!A (the 
translation of A9 = (L143)3) is derivable in 2-LL. 

It is not unlikely that a more complex 'multicolour' embedding can be construc- 
ted that does have both properties. Let us, however, for the moment stick to the 
'simple' one. 


DEFINITION 3 A marking of an S4-derivation that assigns t to all occurrences of 
(sub)formulas of the form QA, and q to all occurrences of the form OA, is called 
a sound marking. . 


And now for the upshot. 


THEOREM 4 Let n be an S4-derivation. If we reduce in linear logic a decoration 
7* corresponding to a sound marking of n, and delete all 1-indexed exponentials 
from the reduct, then (modulo possible repetitions of sequents) the result is an S4- 
derivation v! which is a reduct of x (in other words, the diagram of page 34 applies 
to S4). 
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Proof. Due to the fact that all formulas QA are marked t and all formulas DA are 
marked q, the reductions of the decoration can not induce permutations that are 
impossible in S4. H 


REMARK 5 The property expressed by Theorem 4 does, in general, not hold for 
decorations n* obtained from markings of an S4-derivation that are not ‘sound’. 


It will be most instructive to show why. (This moreover should clarify the proof of 
the theorem.) 


Consider the following possible occurrence of a cut in an S4-proof: 


C,Ur; > 9A, 941 T2,0A => A2 
OC,LUT; > 04,041 T2,0A=> Ao, B 
QC,UT1,D2 > 041, A2, B 


(The exact nature of the last inference in the right premiss in fact is irrelevant. 
What counts is that QA is not main formula of the rule.) 


Suppose we mark QA by q, and all other occurring formulas as well (this how- 
ever, as the reader will see, is of no consequence for the argument). Then in a cor- 
responding decoration n* the instance of cut becomes 


1C*, (OT1)* 2 ?(0A)*,? (0A1)* : 
?!C* ,(DT1)* 2 ?(0A)*, ? (0A1)* IT5,(0A)* > IAS 
O"C* (Ei)! S704)", 9041)" T3, (OA)* = 2145, 71B* 


AS, in the linear decoration, the right premiss now is a box, we (have to!) reduce 
the decorated cut in linear logic by permuting to the left. This is perfectly allright 
from the point of view of 2-LL, but not from that of S4: were we to perform this 
reduction in 2-LL, then, after two permutations, when deleting all 1-indexed ex- 


ponentials we are likely to find that we left the set of S4-derivations, as the result 
would be 
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T2,0A => A2 
C,UT; > 0A,0^A1 T2,0A> A2,B 
C,DT;,D22 0^1, 42, B 
QC,UT;,D2 > 0A), A2, B 


In order for this to be a correct derivation all formulas in Ta need to be prefixed 
by O, all formulas in Az U (B) by Q. But this we cannot guarantee. 

Precisely this situation and its duals force us to consider decorations obtained 
via sound markings. 


The immediate fruits of our efforts are gathered in the following two corollaries 
to Theorem 4. 


COROLLARY 6 All instances of the cut rule are eliminable from S4-derivations. 


COROLLARY 7 Reductions of S4-derivations n defined as the reflection of the 
normalization in linear logic of decorations x*, corresponding to sound markings 
of n, are strongly normalizing and confluent (cf. [2]). 


3 PROSPECTS 


We had to be sketchy, and the above is far from being a complete ‘linear analysis’ of 
S4-derivations. If, however, it did succeed in bringing across some of the strength 
of linear logic as a proof theoretical tool, it will have served its purpose. 


The same techniques can be applied directly to several modal logics weaker than 
S4. Also, the above results in fact apply to second order extensions of S4 (whatever 
the interest of these might be). Yes, they will indeed apply in all those cases where 
the logic under consideration can be mapped to linear logic by a 
‘decoration-type’ embedding. Hence, however far from universal, the approach 
sketched merits serious consideration as a method, providing e.g. a uniform alter- 
native to several ‘one-shot’ solutions. Exploring its limits should be the subject of 
further research. 


Utrecht University, The Netherlands 
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TOMASZ SKURA 


REFUTATIONS AND PROOFS IN S4 


1 INTRODUCTION 


J. Lukasiewicz introduced the concept of a refutation calculus that axiomatizes the 
set of non-theorems of a logic (cf. [2]). For instance the classical propositional 
logic can be characterized by the following system of this kind: 


Axiom: d L 
Rules: 
4 e(a , ZEN 
(rs) a where e(a) is a substitution instance of a 


(Tmp) rate 8 


Refutation calculi for non-classical logics may be obtained by adding rules of 
the form 


-l 01,...,71 On 
da 


to the above system. The interest of such formulations is mainly theoretical (cf. 
[5, 6]). In particular they provide a method of obtaining decidability results by enu- 
merating non-theorems. 

However it is possible to give more practical refutation rules that are applicable 
to formulas in normal form and that are justified by theorems of the following form 


- aiffforsome1<i<n,Fa; 


where o is a formula in normal form and all œ; are simpler formulas. Using such 

rules for every formula we can construct either a proof or a disproof of it. In fact 

such a system for the intuitionistic propositional logic was given by D. Scott in [4]. 
In this paper we present a system of this kind for the modal logic 54. 


2 PRELIMINARIES 


Let FOR be the set of all formulas generated from the set V AR of propositional 
variables by the connectives A, V, ^, Gl, and the constants T (truth), L (falsity). 
The connectives —, =, © are defined in the usual way. Formulas with no occur- 
rences of O are called modal-free. For any o, 8 € FOR, p € V AR the symbol 
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a(p/) denotes the result of substituting 2 for p in a. We say that o is equivalent 
to B if the equivalence can be shown using theorems of 54, the modus ponens rule, 
and the substitution rule. The symbol 54 denotes also the set of all theorems of S4. 

By an S4algebra we mean an algebra A — (B, I), where B is a Boolean algebra 
and J is an interior operation (cf. [1]). The Boolean operations, the unit element, 
the zero element of A will be denoted by A, V, ~, 1,0 respectively. An element z 
is said to be open iff I(x) = x. The symbol B, will denote the 2-element Boolean 
algebra. For any S4 algebras A), A», A; x A? is the direct product of A; and Ao. 
We also write A” instead of A, x... x A, if Ay =...= A, =A. 


3 RESULTS 


First observe the following simple facts: 


LEMMA 1 Let p € VAR, a € FOR. Then the formula Op — a is equivalent to 
the formula a(p/T). 


LEMMA 2 (cf. [3] Let 8 € FOR. Then B is equivalent to a formula of the form 
A{o(8r; =q;):l1<j<n}AQa>p, 


where allr;,q;,p~pi € VAR, q; £ qk forl < j #k € n, anda is a modal-free 
formula. 


LEMMA 3 Let 8 € FOR. Then Ü is equivalent to a formula of the form 
A(n(ar; =q;):1 <j <n}a N{0(0p:; = qo) : 1 < i < m} A Da > m 


where m > l,allrj,qj,pi,dqo E VAR, q; EZ qk for0< j £ k < n, anda is 
modal-free. 


LEMMA 4 Let A=(B,I) be an S4 algebra and let d be an open element in it. Then 
the function I' : B — B defined thus 


I'() = 


pron if rl (s € B) 


1 otherwise 


is an interior operation. 


Now we prove the following: 
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THEOREM 5 Let o = A(Q(06; 2 7) :1 < j < n} ^A(G(GB; = y): 1 < 
i < m} A Qa, where all 6;,7;, Bi, Yo, œ are modal-free formulas, m > 1. Then 


p> Bı E S4 
iff for some 1 <i < m 

a — Bi V Yo V ... V In € S4, Oô Apa BESA (1€j €n) 
orforsomel<j <n 

p — (Of, = 08;) € 54, y ^ Q(D6; = 4o) > bı € 54. 
Proof. (<=) 


Case 1l. œ — DB; V Yo V... V Yn € S4 and O8; Ap 01€ SA(1 € j <n). 
Then 


p (B6; + fi) € 54 (l<j <n) 
p — (B0; > Of) € $4 (1€j <n) 
p + (OB, = Of;) € S4 
p — (B0; + Ui) € 54 (1€j€n) 
p > (06; ^ Bi) € S4 (1€j €n) 
Also 
p> Bi V WV -V In E S4 
p 3 (yj > 06,) E S4 (<j <n) 
p — (yo — Of;) € S4 
p — (yo > Bi) € S4 
Therefore 
p> pi E S4 
p + Uf; € S4 
yp + Uf € S4 


p — bı E S4 
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Case 2. p + (Dj = 06;) € S4 and p A O(06; = yo) > bı € S4. 
Then 

p — (06; = vo) € S4 

p + O(06; = y0) € S4 

yp pı E S4 


(=) Assume that for every 1 < ? < m 

a — Bi V Yo V ... V4» € S4 or Oô; Ap > Bi d S4 for somel <j <n 
and for every 1 <j <n 

p — (08, = 08;) g S4 or y A O(08; = yo) > bı E S4. 

Now if 


Oô; ^ > Bi g S4 


or 

p — (Od; + Bi) Z S4 
or 

pAO(O6; = yo) > Bi E S4 
then 


p > bı ¢ 54. 
So we assume that for all 1 < i < m 
a —> Bi V Yo V... V In E S4 
and forall 1 <j € n 
pA Uf > 6; d S4. 


It follows that for every 1 € 7 < m there is a valuation v; in B1 such that v;(a) = 1, 
vi(Bi) = vi(y;) = 0 (0 € j € n) and for every 1 € j € n there is a valuation 
w; in some 54 algebra A; such that w; (p) = w;(f1) = 1, w;(6;) Z 1. Hence 
w;(6;) =1(1 <i <m) andw,(O8) =1(1 < j € n), where 8 = Bi A... A Bm. 
Moreover 


v;(Od; ^ OB) = 0 = vi(y;) (l<j<n) 


v;(O 8; ^ Q8) = 0 = vi(vo) (l1<l<m) 
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wj;(H8; ^ 08) = w;(BBi) = w;(e) (1€ i Xm) 


Consider the S4 algebra A = BY” x Aj x ... X An. Let v be the valuation in A 


defined thus: v(p) = (vı (p), ..., vs (p), wı (p), ...,wa(p)) (pe VAR). 
Then 


v(0;) # 1 (l<j <n) 
v(B;) £1 (1€ i € m) 

Also | 
v(Od; ^ OB) = v(y;) (l<j<n) 
v(OB; ^ OB) = v(vo) (l<i<m) 
v(o) 21 


Now using Lemma 4 we define the S4 algebra A' obtained from A by replacing the 
interior operation J of A with the function 7’ defined thus 


l'() = | I(z)^v(08) if «#1 


(x € A) 
1 otherwise 


Let v' be the valuation in A’ defined as follows: v'(p) = v(p) (p E V AR). 
Then 


v'(06;) = v' (7) (l<j<n) 
v'(O8;) = v'(70) (l<i<m) 
v (Oa) = 1 
v'(81) #1 
Therefore v'(y — 01) Z 1, whence v — bı ¢ S4. i 


Next we define the following refutation system for 54: 


Axioms: d o , where a is a modal-free formula that is not a theorem of Classical 
Logic 


Rules: rap, Ts, 
doa fVv4oV..V^n (lL<i<m) doAUD-ó; (1€j3€n) 
QU 


where p = A{O(O6; = yj): 1 < j €&nJAA(G(BB; = yo): 1 € i € m) AGo, 
m > 1, and all a, Di, Yj, Ôj, Yo are modal-free. 
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REMARKS6 I. Ifn = Q then the rule r has the form 


da Bi V Yo (1«€i € m) 


2. From the proof of Theorem 5(=>) it is clear that the refutation rule r is valid 
in S4, i.e. the set FOR — S4 is closed under this rule. Hence the above 
refutation system is really a refutation system for S4, i.e. no theorem of S4 
can be refuted in it. 


Finally we can prove the following completeness theorem for S4. 
THEOREM 7 Let w be a formula of the form p — pı, where p = A{QO(Qr; = 
qj) :1 < j < n}AA{O(Op; = qo) :1 < 1 < m}AQa, m > 1, all pi, qj, Tj, qo € 
VAR, q; Z qk for0 € j x k € n, anda is modal-free. Then either - p or Àj. 


Proof. By induction on the number of variables occurring in v. 


1. w has just one variable. Then y = O(Op; = p1) A Oa — pı. Consider the 
formula a — pı. It is modal-free, so either à — pı or d œ — pi. (We 
assume that every theorem of Classical Logic has a proof.) Hence either F Y 
or 4 y. 


2. w has more variables than one. Then the formulas 
(o > p)(r;/T) 
(o > rji)m/T) 
(p ^ QO(Or; = qo) ^ m)(a; /ao), 


where 1 < 7 € n, have less variables than vy. Also after some simple reduc- 
tions their form becomes that of y. Hence by the inductive hypothesis every 
one of them has either a proof or a disproof. Moreover the above formulas 
are equivalent to 

A;-UÜUrj ^om. 

B; =y ^0pi >rj 


C; =yAQ(Or; = qo) — D 


respectively, so that all A;, B;, C; are either provable or refutable. 
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Also forevery 1 < i < meitherl- D; ord D;,where D; = a > pi;VqoV ...Vqs, 
because each D; is modal-free. 
Now we have the following cases: 


1. 4Di (1€ i € m) andJd Bj (1 <j € n). Thend e > py. 
2. F Di for some 1 <i € mort Bj forsome 1 <j € n. 


If A; or 4 C; then d p — pi, so we assume that - A; and- C; (1€ j < 
n). Hence by the argument in the proof of Theorem 5(«-) we obtain F y — py. 
Therefore either F V or Jd vy. L| 
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PART II 


EXTENDED FORMALISMS 


EWA ORLOWSKA 


RELATIONAL PROOF SYSTEMS FOR MODAL 
LOGICS 


1 INTRODUCTION 


The purpose of this paper is to give a survey of the relational formalization of modal 
logics. The paradigm ‘formulas are relations’ leads to the development of a rela- 
tional logic based on algebras of relations. The logic can be viewed as a generic 
logic for the representation of nonclassical logics; in particular a broad class of 
multimodal logics can be specified within its framework. As a consequence, proof 
systems for the relational logic become a convenient tool for the development of 
a proof theory for nonclassical logics. The relational logic enables us to represent 
within a uniform formalism the three basic components of any propositional logi- 
cal system: syntax, semantics and deduction apparatus. The essential observation, 
leading to a relational formalization of logical systems, is that a standard relational 
structure (a Boolean algebra with a monoid) constitutes a common core of a great 
variety of nonclassical logics. Exhibiting this common core on all the three levels 
of syntax, semantics and deduction, enables us to create a general framework for 
representation, investigation and implementation of nonclassical logics. 

The relational formalization of nonclassical logics is realized on the following 
methodological levels: 


Syntax: With the formal language of a logic L there is associated a language of 
relational terms. 


Semantics and model theory: With logic L there is associated a class of rela- 
tional models for L, and in these models the formulas from L are interpreted as 
relations. Each relational model determines an algebra of relations. The class of 
algebras derived from the underlying class of relational models for L provides a 
relation-algebraic semantics for L. 


Proof theory: With logic L there is associated a relational logic for L such that 
its proof system provides a deduction method for L. 


The semantics of nonclassical logics usually carries on twofold information: 
information about extensional components of the logical system and information 
about its intensional components. Relational formalisms are rich enough to repre- 
sent uniformly these two types of information. Extensional information is modeled 
by means of a Boolean part of algebras of relations and intensional information by 
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means of its monoid part. The crucial element of this construction is a decompo- 
sition of both extensional and intensional elements of a logical system under con- 
sideration into primitive ingredients and next the explication of those of them that 
can be treated as elements of a Boolean algebra or a monoid. 


In the relational representation of logical systems we are going to articulate ex- 
plicitly information about both their syntax and semantics. Generally speaking, 
formulas are represented as terms over algebras of relations. Each of the propo- 
sitional connectives becomes a ‘logical’ relational operation, and in this way an 
original syntactic form of formulas is preserved. Semantic information about a for- 
mula, which normally is included in a satisfiability condition, consists of the two 
basic parts: first, we say which states satisfy the subformulas of the given formula, 
and second, how those states are related to each other via the accessibility relation. 
Those two ingredients of semantic information are of course interrelated and unsep- 
arable. In the relational representation of formulas, terms representing accessibility 
relations are included explicitly in the respective relational terms corresponding to 
the formulas. They become the arguments of the relational operations in a term in 
the same way as the other of its subterms, obtained from subformulas of the given 
formula. In this way semantic information is provided explicitly on the same level 
as syntactic information, and the traditional distinction between syntax and seman- 
tics disappears, in a sense. In the relational term corresponding to a formula both 
syntactic and semantic information about the formula are integrated into a single 
information item. 


One of the advantages of the relational representation of formulas is that we 
gain compositionality. In most of the nonclassical logics their formulas that are 
built with intensional propositional connectives, for example with modal or tem- 
poral operators, are not compositional. The meaning of a compound formula is not 
necessarily a function of meanings of its subformulas. In the relational formalism 
the counterparts of these connectives become compositional. The classical oppo- 
sition between compositional and noncompositional is eliminated when we pass to 
the relational counterpart of a given nonclassical logic. In several applications of 
modal logics there is a need to distinguish between information about static facts 
and dynamic transitions between states in a domain which a given logic is intended 
to model. The relational representation of modal formulas enables us to express an 
interaction of these two components of information in a single, uniform formalism. 
In relational logics an opposition between static and dynamic or between declara- 
tive and procedural is transformed into a coexistence and interaction of these two 
types of information in a uniform framework. 


Relational proof theory enables us to build proof systems for nonclassical log- 
ics in a systematic modular way. First, deduction rules are defined for the common 
relational core of the logics. These rules constitute a core of all the relational proof 
systems. Next, for any particular logic some specific rules are designed and ad- 
joined to the core set of rules. Hence, we need not implement each deduction sys- 
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tem from the scratch, we should only extend the core system with a module corre- 
sponding to a specific part of a logic under consideration. 

From the algebraic perspective the relational formalization of modal logics leads 
to what might be called ‘nonclassical’ algebras of relations. In these algebras the 
‘logical’ relational operations are admitted which are relational counterparts of in- 
tensional propositional connectives. They are not always expressible in terms of 
the standard relational operations. 

The standard semantics of modal logics is usually defined in terms of frames 
([5, 6]), that is relational systems consisting of a set W of states and a family of ac- 
cessibility relations in W. Then the meaning of a formula is defined, first, by means 
of an assignment m of subsets of W to propositional variables, and second, by ex- 
tending m to all formulas of the language under consideration. In this way every 
formula A is interpreted as a subset of states, with the intuition that m(A) con- 
sists of those states in which A is true. The meaning m(A) of formulas built with 
the classical propositional connectives of negation (~), disjunction (V), conjunc- 
tion (A), implication (—), and equivalence («€») is defined from the meanings of the 
subformulas of A by the well known interpretation of these connectives in terms of 
Boolean operations of complement (—), join (U), and meet (N). The meaning m(A) 
of formulas built with intensional operators, such as modal operators of possibility 
and necessity, dynamic logic operators determined by programs ([2, 16]), informa- 
tion operators determined by an indiscernibility or similarity relation ([11, 21]), is 
usually defined in terms of the values of m for the subformulas of A and in terms 
of an accessibility relation, which is most often a binary or ternary relation in W. 

The idea of the relational formalization of nonclassical logics comes from the 
interpretability of logics in a relational logic ([12, 13, 14]). From the logical per- 
spective the interpretability is established by means of a validity preserving trans- 
lation of formulas of logical systems into relational terms. Under that translation 
both formulas, formerly understood as sets of states and accessibility relations, re- 
ceive a uniform representation as relations of the same finite rank, and propositional 
connectives are transformed into logical relational operations. A survey of the re- 
lational semantics for nonclassical logics can be found in [15]. From the algebraic 
perspective the translation leads to the algebras of right ideal relations. 

Let W be a nonempty set. The full algebra of binary relations over W is the 
algebra: 

Re(W) = (Sb(W x W),—,U,0,1,0,;,7 ,1) 


where (Sb(W x W), —,U,/N, 1, 0) is the Boolean algebra of subsets W x W, 1 = 
W x W is the universal relation, 0 is the empty relation, J = ((w, w) : w € W} is 
the identity in W, operations ; and ^! are relational composition and conversion, 
respectively. The class RRA of representable relation algebras consists of isomor- 
phic copies of subalgebras and direct products of full algebras of relations. Every 
algebra from RRA is isomorphic to an algebra whose elements are binary relations, 
1 is an equivalence relation, and J is an identity on the field of 1. If 1 has exactly 
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one equivalence class, then such an algebra is simple. The class RRA is a variety 
(Tarski [19, 20]), but its equational theory is not finitely axiomatizable, and more- 
over, an infinite axiomatization of RRA requires infinitely many relation variables 
(Monk [9, 10]). For any relations P, Q in Re(W) the following conditions are sat- 
isfied: 
PROPOSITION 1 

(ao) Pe Qiff-PuQ-1 

(b) P-Qif(-PuQ)n(-QnP)z1 

(c) P Zliffl;(-P;1-1 

(d4)P=landQ=1iff(PNQ)=1 

(e) P =1lorQ=1iff1;-(1;(-P);1);1UQ =1 

(f) P = 1 implies Q = 1 iff1;(-—P);1UQ =1 

Conditions (a)-(f) show that any open formula of the equational theory of al- 

gebras of relations is expressible as an equation of the form P = 1, where P is a 
relational term. 


In the relational representation of logical formulas the operations of right (/) and 
left (X) residuation play an important role: 


R/P = ((y, z) : (x,y) € P implies (z, z) € R for all x) 
Q \ R= ((z,y) : (y, z) € Q implies (z, z) € R for all z}. 


Residuations are definable in terms of the standard relational operations: 
R/P = -(P^; -R), Q\R=-(-R;Q7'). 


A binary relation P is said to be a (right) ideal relation whenever P;1 = P. 
If P, Q are ideal relations, then so are — P, P U Q, P N Q. Moreover, if Q is an 
ideal relation, then for any P relations P; Q and Q/P are ideal as well. Ideal rela- 
tions can be viewed as unary relations (or sets) which are dummy embedded into 
relations of a higher rank. It follows that the class RRA is a generalized reduct of 
a class of subalgebras of algebras of relations of any rank higher than 2. A com- 
prehensive survey of the theory of algebras of relations can be found in Maddux 


[7]. 


2 RELATIONAL SEMANTICS 


The first step toward a relational formalization of a nonclassical logic L is a trans- 
formation of the language of L into the language of relational terms. In the present 
section we give a number of examples of how formulas from various nonclassi- 
cal logics are represented as relational terms. We define inductively a function t 
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from formulas of a nonclassical logic into relational terms. The function can also 
be interpreted in semantic terms as a meaning function that assigns relations to for- 
mulas, and therefore we often refer to t as to the mapping that provides a relational 
semantics. Assume that every propositional variable p is translated into a relational 
variable t'(p). From the semantic point of view it means that a relation is assigned 
to every propositional variable. Next we define: 


t(p) = t'(p);1 


This condition assures that propositional variables are interpreted as right ideal rela- 
tions. Furthermore, classical propositional connectives are interpreted as Boolean 
operations: 


(5A) = —t(A), t(AV B) = t(A)Ut(B), t(A^ B) = t(A) N t(B) 


The translation of modal formulas built with the possibility operator (A) and ne- 
cessity operator |] determined by a binary accessibility relation R is: 


t((R)A) = R;t(A), t([R]A) = —(R;-t(A)) 


Observe, that the necessity operator can be equivalently expressed by means of 
right residuation: 
«([R]JA) = (4)/R7 


In particular, for dynamic logic modal operators we have: 
t((Ru P)A) = (RUP);t(A) = R;t(A)U P;t(A), 


(UG P)A) = R; P;t(A), t((R")A) = R*;t(A) 


In dynamic logic with specification operators that have been introduced in [4] we 
have in addition: 


((R/P)A) = (R/P);t(A), t(QV IDA) = (Q \ R); t(A) 


The relational semantics for the intuitionistic operations of negation (^;4;) and 
implication (—;5;) is: 


t(A Sine B) = —(R; (t(A) n —t(B))) 


t(-intA) = —(R;t(A)) 


where R is a reflexive and transitive binary accessibility relation such that for all 
propositional variables p we have the following counterpart of the atomic heredity 
condition: 


Ro; t(p) € t(p). 


60 EWA ORLOWSKA 


The relational semantics for the temporal operator Nezt is: 
t(NextA) = (R5t(A)) n -(R; P) 


where R is a reflexive, transitive, and weakly connected accessibility relation. 
Post operations D;, i = 1,2,..., and Post constants E;, 7 = 0,...,w in wt- 
valued logic ((17]) have the following relational semantics: 


t(D,A) = d;t(A), t(E;A) = (In (dj; R);1 


where I is the identity relation, R is an accessibility relation satisfying the same 
conditions as the intuitionistic accessibility relation, and d;,z = 1,2,... are the 
functions that satisfy the following conditions: 


dj;di C di 

ICd,;R' 

I Cdi; R; d; ' 

R C di; R; di 

I Cdi; R iff not I C di44; R^ 

There is 2 = 1,2,... such that I C dj; R 


The above conditions are the counterparts of the respective conditions assumed in 
Post frames ([8]). 

The relational semantics for relevant logics can be found in Orlowska [13]. It is 
easy to see that in all the above examples for any formula A its relational counter- 
part t(A) is a right ideal relation. 

The basic idea behind the relational semantics is that we assign right ideal rela- 
tions to formulas. This interpretation has a natural motivation. To define a concept, 
for example the concept ‘prime number’, we normally say that a prime number is 
a natural number whose divisors are 1 and the number itself. This definition con- 
sists of two parts. The first part ‘natural number’ refers to a broader concept whose 
instances form a universe from which the instances of our concept ‘prime number’ 
come from. The second part of the definition describes the positive instances of 
our concept. The relational semantics of formulas is designed just according to that 
pattern. The domain of the relation assigned to a given formula A consists of the 
positive instances of a concept defined by A, and the range of that relation coincides 
with the admitted universe of objects to which the concept applies. The important 
fact is that under the relational semantics the accessibility relations are ‘taken out’ 
of intensional operators and they are treated in the same way as formulas. This en- 
ables us to formalize multimodal logical systems such that an algebraic structure 
is assumed in the family of accessibility relations. For example, in an extension 
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of dynamic logic obtained by adjoining operations of complement and intersection 
to program constructors ([14]) we have t((—R) A) = —R;t(A) and t((RN P)) = 
(RM P); t(A), and hence the logic can be formalized as a suitable fragment of the 
relational calculus and treated in the relational framework in the same way as the 
standard dynamic logic. 


3 RELATIONAL LOGICS 


In Sections 3 and 4 we show how a relational framework is constructed for propo- 
sitional nonclassical logics. Given a propositional logic L, whose frames consist 
of a set W and an accessibility relation in this set or a family of accessibility re- 
lations, we define a relational logic Rel L for L. Let V AR be a set of individual 
variables and let CON, be a set of relational constants representing accessibility 
relations from frames of L. For example, if L is a modal or standard temporal logic, 
then CON, = {R} consists of a single constant R. If L is a dynamic logic, then 
CON, consists of atomic programs. The set EX PREL, of relational expres- 
sions is generated from V ARRELU {1, I} U CON, with the standard relational 
operations (—,U,0,; , /, V, 7} ) and possibly with new relational operations which 
might not be definable in terms of the standard ones, that correspond to some of 
the intensional propositional operations of L. Formulas of the relational logic are 
of the form xAy, where x,y € VAR and A is a relational expression from set 
EX PREL. If in frames for L an algebraic structure is assumed in the universes 
of those frames, then we define a set TM Sz of terms generated by V AR with all 
the underlying algebraic operations. For example, if in models of a temporal logic 
a unary successor operation s is admitted in the set of moments of time, then the 
respective set TM Sz contains terms of the form s(x) for all x € TM SL. Next, 
formulas of the relational logic for L are defined as expressions of the form x Ay, 
where z, y € TMS, and A € EX PREL,. Semantical structures for relational 
logics of the form Rel L are models of the form: 


M = (W,m) 


where W is a nonempty set and meaning function m : VARRELU{1, I}UCON, 
—> Sb(W xW) assigns binary relations in W to relational variables and constants, 
in such a way that 1, J, and constants from CON, receive their intended interpre- 
tation as the universal relation, identity and the accessibility relations from models 
of L, respectively, and the relation variables are mapped into right ideal relations. 
Function m extends to all the relational expressions from E X PRE Lz in a homo- 
morphic way. Resuming, the meaning function in relational models is defined as 
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follows: 
m(P) C W and m(P); m(1) = m(P), for P € VARREL 
m(R) C W x W forevery R € CON, 
m(1) - W xW, m(I) = {(w,w): we W} 
m(~A) =—m(A), m(A~*) = m(A)* 
m(AU B) 2 m(A)Um(B), m(An B) =m(A)Nm(B) 
m(A; B) = m(A); m(B) 


and, moreover, if a relation represented by R € CON satisfies some constraints 
in logic L, then we assume that in the models of the relational logic for L relation 
m(R) satisfies the same constraints. For example, in temporal logic of linear time 
we assume that in every model relation m( R) is weakly connected. 

By a valuation in M we mean a function v : VAR —»5 W assigning elements 
of W to the individual variables. A relational formula x Ay is satisfied by v in M 
whenever the following holds: 


M,v sat zx Ay iff (v(x),v(y)) € m(A). 


Formula xAy is true in M iff M,v sat xAy for all the valuations v in M, and 
x Ay is valid iff it is true in all models. In other words, formula z Ay is true in a 
model M whenever m(A) = m(1) holds in M. Relational logics Rel L defined 
according to the above scheme and such that set CON, is nonempty, or the set of 
relational operations includes some nonstandard operations, are called nonclassi- 
cal relational logics. The classical relational logic is the logic with formulas x Ay 
such that A is a term built with the standard relational operations as defined in full 
algebras of relations. The semantics of relational logics defined above provides an 
interpretation of relational variables as right ideal relations. The respective models 
determine classes of algebras of relations generated by right ideal relations. These 
models are referred to as RJ-models, and the corresponding relational semantics is 
called RJ-semantics. 

A semantic relationship between a nonclassical logic L and the relational logic 
RelL for L is established in the lemmas of the following scheme. 


PROPOSITION 2 For every model of L of the form M = (W, family of relations, 
m), there is a model M' = (W,m’) of the relational logic for L such that for any 
formula A of L and for any w € W we have: 


(i) M,w sat A iff(w,z) € m'(t(A)) forall z € W. 


Proof. We define model M' as follows. Its universe coincides with the universe W 
of M. If P € VARREL and P = t'(p) fora propositional variable p, then we put 
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m'(P) = m(p) x W. If R € CON, then we put m'(R) = R, that is the meaning 
of constant R in the relational model is the relation from model M denoted by that 
constant, as usual we use the same symbols for both of them. The proof of the re- 
quired condition is by induction with respect to the complexity of A. We show the 
induction step for a modal logic L and for a formula of the form (R)A. We have 
M,w sat (R)À iff there is t € W such that (w,t) € Rand M,t sat A. By the in- 
duction hypothesis there is t such that (w, t) € m'(R) and (t, z) € m'(t(A)) for all 
z € W, which yields (w, z) € m'(R); m'(t(A)) = m'(R;t(A)) = m'(t((R)A)). 

a 


PROPOSITION 3 For every model M' = (W,m’) of the relational logic for L 
there is a model M of L such that condition (i) is satisfied. 


Proof. We define model M as follows. Its universe coincides with the universe W 
of M'. Accessibility relations in M are all the relations m'(R)forR. € CON ,. 
For any propositional variable p we put m(p) = domain of m'( P) where P = t'(p). 
By induction on the complexity of a formula A one can show that condition (i) is 
satisfied. B 


PROPOSITION 4 (a) A formula A of logic L is valid in L iff xt(A)y is valid in 
Rel L.(b) Formulas A1,..., An imply a formula B in L iff 1; —C(t(A1) N... N 
t(An));1Ut(B)y is valid in RelL. 


Proof. To prove (a) assume that a formula A is valid in logic L. Suppose that there 
is a model M’ = (W',m/) of the relational logic for L and there are z, y € W' 
such that (z, y) ¢ m'(t(A)). By Proposition 3 there is a model M of L with the 
universe W' such that not M,z sat A, which contradicts the assumption. Now 
assume that z£( A)y is valid in the relational logic for L and suppose that A is not 
valid in L. Hence there is a model M = (W, family of accessibility relations, m) 
of L and there is w € Wsuch that not M,w sat A. By Proposition 2 there is a 
model M’ of the relational logic for L with the universe W, and there is z € W 
such that (w, z) ¢ m'(t(A)), which contradicts the assumption. We prove condition 
(b) for n = 1. Assume that for every model M of L, if A is true in M, then B is 
true in M. Suppose that there is a model M' = (W, m) of logic Rel L such that 
m(1; —t(A); 1Ut(B)) zz m(1). By 1 (f) from Section 1 it yields m(t(A)) = m(1) 
and m(t(B)) z m(1). By Proposition 2 there is a model M of L having the same 
universe as M' such that A is true in M and B is not true in M, a contradiction. 
Now assume that z1; —£(A); 1 Ut(B)y is valid in RelL. Suppose that A does not 
imply B in L, that is, there is a model M of L such that A is true in M but B is not 
true in M. By Proposition 2 there is a model M’ = (W, m’) of RelL with the same 
universe as the universe of M such that m'(t(A)) = m'(1) and m'(t(B)) 4 m'(1). 
By 1 (f) we obtain a contradiction which completes the proof. a 
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We conclude that, given a nonclassical logic L, a deduction system of the rela- 
tional logic RelL can serve as a theorem prover for L. In the following sections 
relational proof systems will be outlined for some nonclassical logics. 


4 PROOF SYSTEM FOR THE CLASSICAL RELATIONAL LOGIC 


Proof systems for relational logics are Rasiowa-Sikorski style systems ([18]). In 
the present section we recall the deduction rules for the classical relational logic 
([12]), that is the logic whose formulas x Ay are built from terms A generated by 
V ARRELU({1, I} with the standard relational operations. The rules apply to finite 
sequences of relational formulas. There are two groups of rules: decomposition 
rules and specific rules. Decomposition rules enable us to decompose formulas in 
a sequence into some simpler formulas. Decomposition depends on relational op- 
erations occurring in a formula. As a result of decomposition we usually obtain 
finitely many new sequences of formulas, and sometime, as in the case of opera- 
tion —* in dynamic logic, infinitely many sequences. The specific rules enable us to 
modify a sequence to which they are applied, they have a status of structural rules. 
The role of axioms is played by what is called fundamental sequences. In what fol- 
lows K and H denote finite, possibly empty, sequences of formulas of a relational 
logic. A variable is said to be restricted in a rule whenever it does not appear in any 
formula of the upper sequence in that rule. 


(DEC) Decomposition rules for the standard relational operations: 


(U) K,zAU By, H (—U) K,x-(AUB)y,H 
K,xAy,xBy, H K,x— Ay,H K,- By, H 

(N) K,xAN By, H (—/) K,x — (AN B)y, H 

K,xAy,H K,xBy,H K,x — Ay,x — By, H 


(--) K,r-— —Ay, H 
K,xAy,H 


(1) K,zA ly, H (—~*) K,z -(A y H 
K,yAz, H K,y — Az, H 
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G) K,zA;By, H 
K,rAz,H,xA;By K,zBy,H,rA;By 
z is a variable 
(—;) K,z — (A; B)y, H 
K,<x — Az, z — By, H 


z is a restricted variable 


(\) K,2A\ By, H 
K,y — Az,xBz,H 
z is a restricted variable 
(-\) K,z - (AVBy,H 
K,yAz,H,c-(A\B)y K,x— Bz H,x -(ANMB)y 
z is a variable 
(/) K,zA[By, H 
K,z — Bx,zAy,H 
zisarestricted variable 
(-/) K,x—(A/B)y, H 
K,zBz,H,r—(A/JBy K,z—AyH,x-—(A/B)y 


z is a variable 


(SPE) Specific rules: 
(11) K,xAy, H 
K,xIz,xAy,H K,zAy,xAy, H 
z is a variable, A c VARREL 
(12) K,zAy, H 


K,rAz,rAyH K,zlIy,rzAy,H 
zisa variable, A c VARREL 
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(I3) K, aly, H 
K, yIx, H, rly 
(I4) K,xIy, H 


K,xIz,H,xIy K,zly,H,xly 


z is a variable 


(ideal) K,xAy, H 
K,xAz, H, xAy 
z is a variable, A c VARREL 
(FND) Fundamental sequences: 


A sequence of formulas is said to be fundamental whenever it contains a subse- 
quence of either of the following forms: 
(f1) zAy, x2 — Ay where A is a relational expression 
(f2) rly 
(f3) xix 


A sequence K of relational formulas is valid iff for every model M of the rela- 
tional logic and for every valuation v over M there is a formula in K which is satis- 
fied by v in M. It follows that sequences of formulas are interpreted as (metalevel) 
disjunctions of their elements. A relational rule of the form K/(H, : t € T} is 
admissible whenever sequence K is valid iff for all t € T sequence H; is valid. It 
is easy to see that the fundamental sequences are valid and all the rules given above 
are admissible. 


PROPOSITION 5 (a) All the rules in (DEC) and (SPE) are admissible. 
(b) All the sequences in (FND) are valid. 


Admissibility of the decomposition rules follows from the definitions of the respec- 
tive relational operations, and admissibility of the specific rules follows from the 
properties of the relational constants or (in the case of rule (ideal)) variables re- 
flected by those rules. For example, rules (71) and (72) are admissible because we 
have I; A = A = A; I for any relation A. Rules (13) and (14) are admissible be- 
cause of symmetry and transitivity of I, respectively. Rule (ideal) is admissible 
because every relational variable is interpreted as a right ideal relation. 

Relational proofs have the form of trees. Given a relational formula x Ay, where 
A might be a compound relational expression, we successively apply decomposi- 
tion or specific rules. In this way we form a tree whose root consists of zAy and 
whose nodes consist of finite sequences of relational formulas. We stop applying 
rules to formulas in a node after obtaining a fundamental sequence, or when none of 
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the rules is applicable to the formulas in this node. A branch of a proof tree is said 
to be closed whenever it contains a node with a fundamental sequence of formulas. 
A tree is closed iff all of its branches are closed. 

In the following a completeness theorem is given for the classical relational logic 
with respect to the class of its RJ-models. 


PROPOSITION 6 (Completeness theorem) A relational formula x Ay is 
valid iff there is a closed proof tree with the root x Ay. 


Proof. (=>) Suppose that there is no closed proof tree for z Ay and consider a tree 
satisfying the following conditions for every non-closed branch b. We write G € b 
whenever formula G is a member of a sequence of formulas in a certain node of b. 
(bl)zAy€b 
(b2) If x(B U C)y (x — (B N C)y) € b then both zBy (x — By) € b and zCy 
(x — Cy) € b obtained by application of rule (U) (resp. (—/N)) 
(b3) If x — (BU C)y (z(B N C)y) € b then either x — By (zBy) € bor x — Cy 
(xCy) € b obtained by application of rule (—U) (resp. ((1)) 
(b4) If z(B; C)y € b then for every z € VAR eitherzBz € bor zCy € b obtained 
by application of rule (; ) 
(b5) If x — (B; C)y € b then for some z E VAR both x — Bz E€ bandz—Cy € b 
obtained by application of rule (—; ) 
(b6) If x — —By € b then xBy € b obtained by application of rule (——) 
The similar closure conditions are postulated for formulas built with operations of 
conversion and residuations. 
(b7) If zBy € b with Be VARREL then for every z € VAR either zIz € bor 
z By € b obtained by application of rule (11) 
(b8) If zBy € b with B € VARREL then for every z e VAR either zBz € bor 
zly € b obtained by application of rule (72) 
(b9) If zIy € b, then yIx € b obtained by application of rule (13) 
(b10) If zIy € b, then for every z € VAR either zIz € bor zIy € b obtained by 
application of rule (14) 
(b11)If zBy € bwith B € VARREL then for every z c VAR wehavexBz € b 
obtained by application of rule (ideal). 
Any tree satisfying conditions (b1), ... , (b11) is referred to as a complete proof 
tree. The standard proof-theoretic construction shows that for every formula there 
is a complete proof tree with this formula in a root. 

Let b be a non-closed branch of a complete proof tree. We define system: 


M? = (W*, m) such that 
W*-VAR 
m*(P) = {(2,y) € W? x W? : zPy d b} for P € VARRELU {1,1} 


We extend m? in a homomorphic way to all the relational expressions. 
Observe that: 
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(i) m^(1) is an equivalence relation in set W^. 

For suppose that for some x we have (z,x) ¢ m*(I). It follows that zIx € b, 
and then branch b would be closed, a contradiction. Now let (x,y) € m*(I), and 
hence rly £ b. If (y, x) g m? (I), then yIz € b, and by (b9) we have zIy € b, a 
contradiction. Similarly, (b10) enables us to prove transitivity of J. 

(ii) m^(I); m? (B) = m? (B) = m? (B); m? (I) for any relational expression B. 

If B is a relational variable, then the condition holds by (b7) and (b8). For the com- 
pound relational expressions the proof is by induction with respect to the complex- 
ity of the expression. 

(iii) m? (1) is the universal relation on W^. 

For otherwise zly € b for some x, y which would make branch b closed. 

(iv) m*(P) is an ideal relation for any P € VARREL. 

This condition follows from (b11). 

Let v? be a valuation in M* such that v? (x) = x for every object variable z. 

We say that formula z By is indecomposable whenever B € VARREL Uù {1, I}. 
Let IN D* be the set of all the indecomposable formulas occurring in the nodes of 
branch b. From the definition of m* we have: 

(v) For every zBt € IN D* we have not M*,v^ sat zBt. 

We define an ordering of relational terms: 

if P is a relational variable then ord(P) = ord(1) = ord(I) = 1 

if ord(B) = n then for any unary relational operation * we define ord(* B) = n+1 
if ord(B) € n and ord(C) < n and at least one of the inequalities is =, then for 
every binary relational operation [f| we define ord( BC) =n +1. 

We will show that: 

(vi) not M*,v* sat x Ay. 

For suppose conversely, and let X? be the set of formulas z Bt on b such that M? , v^ 
sat z Bt. X" is nonempty since z Ay is its member. Let C be a term of a minimal 
order such that uC'w is in X* for some variables u, w. We show that C must be ei- 
ther a relational variable or 1 or J. C cannot be of the form u — Pw for a relational 
variable P, for otherwise we would have u — Pw € b and M*,v sat u — Pw, 
and from the definition of m? the latter is equivalent to uPw € b. A similar ar- 
gument shows that C is neither of the form —1 nor —J. Suppose that C is of the 
form C4; C5. Hence conditions (al) uC1; Cow € band (a2) M*,v^ sat uC4; Cow 
hold. From (a1) and (b4) we conclude that for all z either uC z € b or zC3w € b. 
From (a2) we have that there is t such that M^, v? sat uC1t and M? v? sat tCw. 
Hence either uC,t € X? or tCow € X°, and C1, C» have a smaller value of ord 
than C, a contradiction. In a similar way we show that C is neither an expression 
built with any other relational operator nor a complemented compound expression. 
In view of the above, uCw € IN D^, and hence by (v) we have not M*, v? sat 
uC wu, a contradiction that completes the proof of (vi). 

To complete the proof of the (=) part of the theorem we define a quotient structure: 
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M? = (W1,m4) such that 

W3 = (/z/ : z € V AR) and /z/ is the equivalence class of m*(1) 
generated by z, 

m* (P) = ((/z/,[y/) € W* x W? : (x,y) € m*(P)) for P € VARREL, 


and mn? is extended in a homomorphic way to all the relational expressions. It fol- 
lows from (ii) that the definition of m? (P) is correct, that is, it does not depend 
on the choice of representatives from the equivalence classes. It is easy to see that 
in M? the relation m?(I) is the indentity in W?, since we have /x/ = /y/ iff 
(x,y) € m*(1) iff (/z/, /y/) € m*(I). Moreover, a standard inductive argument 
shows that we have (/z/,/y/) € m*(B) iff (x,y) € m*(B) for any relational 
expression B. It follows that M? is a model of the relational logic such that x Ay 
is not true in M?, a contradiction. The proof of part (<=) of the theorem is based 
on Proposition 5. a 


Clearly, a similar completeness theorem holds for a relational logic whose lan- 
guage coincides with the language of the classical relational logic and whose se- 
mantics is less restrictive, that is the meaning function in the respective models as- 
signs arbitrary binary relations to relational variables, not necessarily right ideal 
relations. The deduction system for this logic can be obtained from the deduction 
system defined above by deleting rule (ideal). 


EXAMPLE 7 We show that in the classical relational logic with RJ-semantics for- 
mula z(A U —(A; 1))y is valid. 


z(AU —(A;1))y 
(U) 
zAy, x — (A:l)y 
(—;) restricted variable :— z 
rÁy,r-— Az, z — ly 
(ideal) new variable := z 
rÁz,r— Áz, z— ly 


fundamental 


EXAMPLE 8 We show that —(A \ B) = (—B \ —A) \ —I holds for any binary 
relations A, B. In view of fact 1 (b) it is sufficient to show that (LAM B) U ((—BW 
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-AJV-I)n(-(AVB)Uu-((-BN-A)V-1D) = 1: 


a((A \ B)U((-B\ —4A) \ -1)) N(-(A\ B) U -((-B \ -A) \ -)y 
(N) 
z(A \ B) U ((-B \ —A) \ -Dy z — (A \ B) U -((-B \ ~A) \ -I)y 
(Y) (Y) 
zA \ By, z(—B \ —A)\ -Iy z-(AVByy, F= z- ((-B \ —A) \ -I)y 


(\) twice (—\) new variable := x 
new variables := z,t x — (A \ B)y, y-BM-Az, F ...,ulz, ... 
y — Az, xBz, y — (—-B\—A)t, x— It (X) new variable := z fund. 
(—\) new variable := z x — (A \ B)y, xBz, y — Az 
y— Az, xBz y-—Az,xrBz (—\) new variable := z 
t— Bz, x—It yAz,x—It yAz,... x— Bz,... 
(11) to zBz fund. fund. fund. 
new variable :— t 
.. ,ait,... ..., tEBz, ... 

fund. fund. 


5 PROOF SYSTEMS FOR NONCLASSICAL RELATIONAL LOGICS 


Relational proof systems are fully modular. Given a reduct of the relational lan- 
guage, we obtain a deduction system for it by restricting the set of rules and the 
set of fundamental sequences to those which refer to the respective operators and 
constants from this reduct. For example, in the relational logic corresponding to 
modal logic K we do not need constant J, and hence we do not need the rules and 
fundamental sequences referring to it. Similarly, given an extension of the basic 
relational language, to obtain a deduction system for the underlying logic, we have 
to define decomposition rules for all the new operators and their complements, and 
the specific rules reflecting the properties of the new constants. 

Let CT be a set of constraints which a binary relation is supposed to satisfy. We 
do not specify here any particular language to express these constraints, it can be for 
example a first order language or a language of relational terms. For a 
constraint C € CT and a relation R, we shall write C(R) to denote a sentence 
obtained from formula C by interpreting the relational constant in C as R. Let 
REL(CT) be the class of relations satisfying constraints from CT: 


REL(CT) = (R € Re(W) : W isaset, and C(R) is true for every C € CT}. 


Let a nonclassical logic L(CT) be given such that the accessibility relation from 
its models belongs to REL(CT). The formulas of relational logic Rel L(CT) for 
L(CT) are built from terms generated by VARREL U {1, I, R}, where R is a 
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constant representing the accessibility relation, with the standard relational opera- 
tions, and possibly with some nonclassical relational operations. We have to pro- 
vide in the language of Rel L (CT) a relational counterpart of every propositional 
operation from the language of logic L(CT'). Models of this relational language 
are systems M = (W,m) such that meaning function m satisfies the conditions 
given in Section 3, and moreover m( R) € REL(CT). The relational proof sys- 
tem for Rel L(CT) consists of the rules and fundamental sequences given in Sec- 
tion 4, decomposition rules for all the nonclassical relational operations admitted 
in the language of Rel L(CT), and specific rules for the accessibility relation, re- 
flecting the fact that it belongs to REL(CT). An example of decomposition rules 
for nonclassical relational operations is given below. 


Decomposition rules for the temporal operators Until and Since: 


(Until) 
K, zAUntil By, H 
K, zRt, H, xXAUntilBy K, tBy, H, zAUntilBy K,xz— Ru, u — Rt, uAy, H, 
xA Until By 
t is a variable, 


u is a restricted variable 


(-Until) 
K, x — (AUntil By, H 
Hi H3 Hs 
where Hı = K, x — Rt, t — By, x Ru, H,x — (AUntil B)y 
Ho = K, x — Rt, t — By, uRt, H, x - (AUntil B)y 
Ha = K, x — Rt, t — By, u — Ay, H,x — (AUntilB)y 


u is a variable, t is a restricted variable 


(Since) 
K, xAS1nceBy 
K, tRz, H, xASinceBy K, tBy, H, xASinceBy K,t— Ru, u — Rz, uAy, H, 
xzASinceBy 
t is a variable, 


u is arestricted variable 


-Si 
(Since) K, xz — (ASinceB)y, H 


Hı Hə H3 
where Hı = K, t — Ra, t — By, tRu, H, x — (ASinceB)y 
Hə = K, t — Rz, t — By, uRz, H, x — (ASinceB)y 
H; = K, t — Rz, t — By, u — Ay, H, z — (ASinceB)y 


u is a variable, t is a restricted variable 
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Let r(R) denote a relational rule of the form K/(H, : t € T} in which rela- 
tional constant R occurs, and let RL(R) be a set of these rules. They are called 
specific rules for R. In a way similar to that developed in Section 4 we define the 
admissibility of a rule in a nonclassical relational logic. Sequence K of relational 
formulas is valid in Rel L(CT ) iff for every model M of Rel L(CT) and every val- 
uation v in M there is a formula F in K such that M, v sat F'. We say that r(R) is 
admissible in Rel.L(CT ) whenever sequence K is valid in Rel L(CT') iff for every 
t € T sequence H; is valid in Rel L(CT). We say that: 

A set RL(R) of rules defines class REL(CT) of relations whenever R € 
REL(CT) iff every rule from RL(R) is admissible. 


As usual, we use the same symbols for relations and the respective constants. 


PROPOSITION 9 If sets RLA(R), RL2(R) of specific rules for R define classes 
REL(CT|), REL(CT3) of relations, respectively, then class REL(CT, U CT») 
is definable by RLA(R) U RL2(R). 


In the following we give an example of a class of relations definable by means of 
a rule. A relation R is euclidean if it satisfies: 

For all x, y, z if (x,y) € Rand (z,z) € R, then (y, z) E€ R. 
The corresponding relational rule is: (euc R) 


K, zRy, H 
K, zRz, H, cRy K, zRy, H, zRy zisa variable 


PROPOSITION 10 Rule (euc R) defines REL( Euclidean). 


Proof. Assume that rule (euc R) is admissible, and suppose that there are a, b, c 
such that (a, b) € R, (a,c) € Rand (b,c) ¢ R. Consider an instance of rule (euc 
R) such that K = z — Rz, z — Ry and H is empty. It follows that sequence S = 
xRy, z— Rz, z — Ry is valid. However, a valuation v such that v(x) = b, u(y) = c, 
and v(z) = a satisfies none of the formulas in S, a contradiction. Now assume that 
Re REL(Euclidean). If the upper sequence S = K, xRy, H in rule (euc R) is 
valid, then so are the lower sequences Sı = K, zRz, H, xRy and $5 = K, z Hy, 
H, x Ry. Now assume that for every model M and every valuation v in M there 
are formulas Fj in Sı and F in 55 such that M,v sat F1 and M,v sat Fo. If 
Fi = zRz and F} = zRy, then since R is euclidean, we have M,v sat «Ry. In 
all the remaining cases F} or F> occurs in S. We conclude that S is valid. a 


EXAMPLE 11 We give arelational proof of the modal formula (R) A > [R](R)A 
which is valid in the class of Kripke models with euclidean accessibility relations. 
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To simplify notation we assume that t( A) = A. 


z(-(R; A) U (R; A)/ R^ !)y 
(U) 
x — (R; A)y, z(R; A/ Ry 
(—;) new variable :— z 

r— Rz, z — Ay, xz(R; A)/R^ ly 

(/) new variable; := v(— ^!) 
x — Rz, z — Ay, x — Rv, vR; Ay 

(;) new variable := z 
x — Rz, z — Ay, x — Rv, vRz, vR; Ay z Ay, ... 


(euc R) new variable := x fundamental 
rRv,..  mzxRz,.. 
fund. fund. 


Observe, that defining the formalization of modal logics within the framework of 
relational logics with RJ-models we can use a simpler version of translation of 
propositional variables: t(p) — t'(p), the property of being right ideal relation is 
guaranteed directly in the RJ-semantics. 


6 RESTRICTED CUT RULES 


In relational proof systems the cut rule has the following form: 


(cut) 
K 
K, xAy K,r-— Ay x,y arbitrary variables 


Admissibility of this rule follows from the fact that a sequence K of formulas is 
valid iff the sequence K, xA N —Ay is valid. In other words, we can extend any 
sequence of formulas in a node of a proof tree with an unsatisfiable formula. 


In relational proof systems for modal logics we often admit a restricted cut rule 
such that the respective unsatisfiable formula is of a particular form. For example, 
in the proof system for the logic with a serial accessibility relation ([2]) we have 
the following restricted cut rule: 


(serial) 


K 


K, x— Ry zisa variable, y is a restricted variable. 
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Clearly, if in every model the relation R is serial, then the rule (serial) is admissible. 
In the completeness proof of the system with this rule the following condition is 
added in the definition of a complete proof tree: 
(b12) For every z there is a y such that x — Ry € b obtained by application of the 
rule (serial). 

Next, in the definition of system M? we extend the meaning function m? by 
putting: 

m? (R) = ((z,y) : zRy d b) 


Condition (b12) enables us to prove that m°(R) is serial. Furthermore, we extend 
the family of indecomposable formulas postulating that z Ry is an indecomposable 
formula, and then the proof can be completed as in Section 4. 


7 INFINITARY RULES 
Infinitary rules in relational proof systems have the form: 
K/{Hi}iew 


A rule of that form is admissible whenever sequence K is valid iff for every 2 € w 
sequence H; is valid. An example of a relational proof system with an infinitary 
rule is the system for dynamic logic ([14]). The rules for the iteration operator * 
are as follows: 
(*) 
K, cA*y, H 
K, zAty, H,xA*y where i € w, A? = I, A't! = A; A' 


(—*) 
K, x — (A*)y, H 
{K, T — (A*)y, H kie. 


The definition of a complete proof tree is extended with the following condi- 
tions: 
(b12) If zB*y € b, then for every i € w, zB*y € b obtained by application of rule 
C) 
(b13) If z — B*y € b, then for some i € w, z — (B*)y € b obtained by application 
of rule (—*). 

Moreover, we suitably modify the notion of ordering < of relational terms: 
(<1) Relational variables and constants are minimal with respect to < 
(<2) A < -A, A < A`! 
(<3) A < AtB and B < AłB fort = U, A, ; 
(<4) A’ < A* forall i Ew 
(<5) A < B implies - À < -B 
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It follows that -A < —(A{B),-B < —(A{B), and —A* < —A*. Ordering 
< is well founded, and in the proof of the completeness theorem analogous to 6 
condition (v) is shown by induction with respect to <. 

Other examples of infinitary rules in relational proof systems can be found in 


[1]. 


EXAMPLE 12 We give a relational proof of the induction axiom from dynamic 
logic. Let F = p ^ [R*](p > [R]p) ^ [R*]p. In the relational representation of 
this formula we use the right residuation operator / for the translation of the neces- 
sity operation. For the sake of simplicity we assume that t(p) = p. We have t(F) 
= —pU R*; (pN (R; -p)) Up/R**. 


zt(F)y 
(U) 
z — py, zR' (pN (R; —5))y, zp/ R^! y 
(/) (-)) 


new variable := z 
z — py, tR*;(pN(R;—p))y, x — R'z, zpy 
G) 


new variable := x 


r—py,r— Re*, zpy, cR*2, r—py,z-— R'z, zpy,xzpN (R; —p)y, 


a R*; (pN (R;—p))y tR*; (pN (R; —p))y 
(*)i=0 (N) 
ea 212, ... 4 EPY, » r—py,r— R*z, zpy, 


fundamental fundamental x(R;—p)y, cR'; (ph (R; —p))y 
G) 


new variable := z 
x — py, x — R* z, zpy, veep Z — PY, we 
zRz, £R*; (pN (R;—p))y, fundamental 
z(R; —p)y 
(—") 
Ko Ky... Kj... 


where K;(z) = x — py, x — R'z, zpy, zRz, cR*; (pN (R; —p))y, z(R; —p)y and 
z is any variable distinct from z and y.We show by induction that for any natural 
number z and for any z # x, y there is a closed tree starting from K;(z).For i = 0 
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we have: 


Ko = {a — py, x — Iz, zpy, zRz, £R*; (pN (R; —p))y, z(R; —p)y} 
(11) to zpy 
new variable := z 
Ko, xlz Ko,xpy 


fundamental fundamental 


For i = 1 sequence K contains z — Rz and z Rz, and hence it is fundamental. 
For 2 = n we have: 
x — py, x — (R^-!; R)z, zpy, zRz, £R*; (pN (R; —p))y, z(R; —p)y 
(—;) new variable := t 
x — py, x — R^ t, t— Rz, zpy, zx Rz, sR*; (ph (R; —p))y, 2(R; —p)y 
(;) to G = 2R*;(pN(R; —p))y 


new variable :— t 


SQZR*'t,..  x— py, x— R^ lt, t - Rz, zpy, zRz, t(pN (R; —p))y, z(R; —p)y G 
(*)i2n-1 (N) 
„o PRE, xz — py, z — R”`}t, x — py, z — R"^-|t, 
fundamental t — Rz, zpy, zRz, tpy, t — Rz, zpy, zRz, 
G, z(R; -p)y t(R; —p)y, G, z(E; —p)y 
(:) to z(R; —p)y (;) to t(R; —p)y 
new variable :— t new variable :— z 
{x — py, x — R"-!t, t— Rz, ...,tpy,... .,tRz,.. 1,2 —py,.. 
zpy, zRz, tpy, G, x Rt, fund. fund. fund. 


z(R; —p)y} = S 


Sequence S includes sequence K,,_1(t) and t # z, y. Hence by the induction hy- 
pothesis there is a closed tree starting from S. We conclude that there is a closed 


proof tree for the induction axiom. 


8 CONCLUDING REMARKS 


In the paper a method has been presented of transformation of nonclassical proposi- 
tional logics into a relational logic. Our main concern has been in the development 
of relational proof systems for nonclassical logics. The basic steps leading to the 


construction of a relational proof system for a given logic L are: 


Translate formulas of L into relational terms, introducing new constants and/or 


relational operations if needed. 
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Define the rules for these new constants and operators, and adjoin them to the 
proof system of the standard relational logic. 


Relational proof systems consist of rules of the following types: 

' Decomposition rules: a compound relational term is replaced by its component 

terms. These rules are counterparts of operational rules in Gentzen style systems. 
Specific rules: they reflect properties of relational constants. They are counter- 

parts of structural Gentzen rules. Some of the specific rules have the form of: 

Restricted cut rules: a cut formula has a particular fixed form. 


A computational framework for relational proof systems is being creating within 
the ATINF inference system. The description of a module of graphical presenta- 
tion of relational proofs can be found in [3]. 
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NUEL BELNAP 


THE DISPLAY PROBLEM 


A Gentzen calculus has the “display property’ if every antecedent [consequent] con- 
stituent can be displayed as the antecedent [consequent] standing alone. It is ex- 
plained why this property is interesting. The ‘display problem’ is the problem of 
designing a calculus with the display property. A perspective is suggested from 
which the solution of Wansing [19] can easily be seen to be incomparable with that 
of [4]. The perspective suggests some other solutions, which are briefly surveyed. 
Additional questions are raised. 


1 THE DISPLAY PROPERTY 


Display logic is a refinement of the sequent calculuses of Gentzen [11]—which for 
reasons given in Anderson and Belnap [2] we call ‘consecution calculuses'!. These 
calculuses concern proof-theoretical implication statements expressed by Gentzen 
with the arrow and by Curry [6] with the turnstile ‘+°. Our usage follows Curry. 
Thus, the consecution 


XFY 


is taken as expressing the fundamental proof-theoretical implicative idea. In Gent- 
zen, always X and sometimes Y can be sequences of constituents separated by 
commas. We call those in X ‘antecedent’ constituents, and those in Y ‘conse- 
quent’ constituents. Later this simple situation will be complicated by the pres- 
ence of nested structuring, and one should then adjust the terminology so that e.g. 
antecedent constituents of antecedent constituents are themselves consequent con- 
stituents. In thinking this through it is sometimes helpful mentally to replace ‘an- 
tecedent/consequent’ by 'negative/positive'. 

Central to display logic is the ‘display property’. The property is in a way techni- 
cal and accordingly difficult to state with a combination of precision and generality. 
We settle for an informal statement. 


1 There is no hope of exporting our usage; but it is so clear and understandable that there seems no: 
reason why we ourselves should not consistently use it. Besides, there ought to be a controversy con- 
cerning whether ‘consecution’ contains ‘cut’ as a well-formed part. 
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THE DISPLAY PROPERTY 


The idea is that if a constituent appears structured as a ‘negative’ or 
‘antecedent’ constituent of a proof-theoretical statement, then there is 
a tightly equivalent way of expressing exactly the same thing in which 
that constituent is ‘displayed’ as the antecedent standing alone. And 
similarly for each constituent that appears structured as a ‘positive’ 
or ‘consequent’ constituent. ‘Tightly equivalent’ is to mean at least 
‘interderivable by the cut-free rules’, not merely ‘equi-provable using 
cut’.? 


The systems of Gentzen do not have the display property. For example, start 
with 


A, B C, 


where A, B, and C are formulas in Gentzen’s formulation of (even) classical logic. 
The display property asks for a ‘tightly equivalent’ consecution having the form 


AF X. 
The natural candidate is 
Akw B,C. 


That consecution is certainly equiprovable with the help of cut, but with equal cer- 
tainty is not interderivable by means of the cut-free rules. In particular, the negation 
rules are not reversible—on pain of losing the subformula property. 

The display property has technical point. In particular, the display property en- 
ables an ‘essentials-only’ proof of cut elimination relying on easily established and 
maximally general properties of structural and connective rules. Display logic is 
generally useful for becoming clearer on substructural logics just because—within 
wide and easily understood limits—proof of cut elimination for a display logic does 
not depend on which structural rules are postulated. In this way, display logics pro- 
vide the only known cut-free formulations of several rich relevance logics (ones 
including boolean connectives and both positive and negative relevance connec- 
tives) and of the brouwerische modal logic. Display logic formulations of S5 are 
the only cut-free formulations that don’t appear cooked up specially just to be cut- 
free. There have been display formulations of many other logics. For example, ear- 
lier proofs of cut elimination for linear logic relied on the associativity and commu- 
tativity of the underlying conjunction-like operation. The display-logic formula- 
tion demonstrates that this reliance is entirely unnecessary and beside the point: In 


?]n order to appreciate the property, one must be able to identify constituents (occurrences) as ‘the 
same’ in the course of transforming one consecution into another. Mere shape-alikeness is not to the 
point. For an exact statement of the property, see the account of the Display theorem in [4]. 
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spite of what one might gather from standard proofs, associativity and commutativ- 
ity have nothing whatever to do with the fact that linear logic has a cut-free formu- 
lation. In this and other cases, formulating a logic as a display logic, often in more 
than one way, has shed light in dark areas. Present space limitations require that 
for detailed illustrations we refer the reader to the following: [4, 5, 19, 12, 13, 14]. 
On the other hand, display logic has not assisted in tasks such as finding decision 
procedures. For a helpful appraisal, see [17]. 

The display property is not merely technical. It has a conceptual point. This 
point depends on an idea of Gentzen that we call ‘the sufficiency idea’: 

THE SUFFICIENCY IDEA 


One can explain the meaning of a connective, say —, by means of two 
sets of rules. (Let us stick to —, just as an example.) One set gives suf- 
ficient conditions for the role of A — B as conclusion or consequent, 
telling us by what A — B is implied. The other set gives sufficient 
conditions for the role of A — B as premiss or antecedent, telling us 
what A — B implies. Since these are the only inferential roles that 
A > B can have, the explanation is complete. 


The difficulty is that this account of the sufficiency idea is not an exact state- 
ment of what Gentzen himself does. In the first place, in Gentzen's consecution 
calculus for intuitionism, there is a familiar asymmetry between the grammar of 
where proof-theoretical premisses go—the antecedent X —and the grammar of where 
proof-theoretical conclusions go—the consequent Y . Briefly put, in X + Y, X can 
be a sequence of premisses, but Y must be a single conclusion. Gentzen's rules for 
A — B exhibit a companion asymmetry. The rules for A — B as conclusion are 
in fact in accord with the sufficiency idea as stated above, but the rules for A — B 
as premiss are not. It is necessary for Gentzen to state not merely how A — B be- 
haves as premiss standing alone. He also must and does give us rules for A — B 
as premiss in the context of other (conjunctively related) premisses. He must tell 
us not only what A — B implies, but what (A — B)-in-context implies. His rules 
are ‘contextual’. 

One might have supposed that when we pass to the symmetrical consecution cal- 
culus that Gentzen introduces for classical logic, the difficulty would be removed. 
Quite to the contrary. In Gentzen's system for classical logic, Y is no longer a sin- 
gle conclusion. The consequent, too, is a sequence. In this circumstance Gentzen 
must and does give us rules for A — B as conclusion in the context of other (dis- 
junctively related) conclusions. Now all his rules are contextual. 

The nub is this. If a rule for — only shows how A — B behaves in context, 
then that rule is not merely explaining the meaning of —. It is also and inextricably 
explaining the meaning of the context. Suppose we give sufficient conditions for 


A>B,XFY 
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in part by the rule 
XFA Bry 
A—B,XrY 


Then we are not explaining A — B alone. We are simultaneously involving the 
comma not just in our explicans (that would surely be all right), but in our expli- 
candum. We are explaining two things at once. There is no way around this. You 
do not have to take it as defect, but it is a fact. Itis as if you explained subtraction, 
‘b — a’, only in the context of addition, ‘(b — a) + x’, so that you explained two 
operators at once. If you are a ‘holist’, probably you will not care; but then there is 
not much about which holists much care. 

Proof-theorists tend not to be holists beyond necessity; proof-theorists prefer to 
articulate their proof theories so that they keep the elements as separate as possi- 
ble. That's where the display property comes in. If a proof-theoretical system has 
the display property, one is not forced to explain the properties of the connectives 
simultaneously with the properties of the context. You can first explain how the 
contextual elements work (structural rules). Then, in a separate explanatory act, 
you can explain the role of e.g. A — B standing alone as antecedent, and standing 
alone as consequent. Finally, you can observe that the display property now guar- 
antees that what you have done suffices to explain how A — B works in every 
context. In other words, it suffices to state with plain words what À — B implies, 
and by what it is implied, exactly in accordance with the sufficiency idea. This pro- 
cedure is certainly not 'essential; but it is good. 


2 THE DISPLAY SCHEMES A ANDP 


Two plans for securing the display property have been suggested. One comes from 
Wansing [19]; thinking of Amsterdam, we shall call it ‘the A scheme’. The other 
comes from [4]; thinking of Pittsburgh, we shall call it ‘the P scheme’. It appears 
that neither of these schemes is exactly a generalization of the other. We offer a per- 
spective from which one can see how one might arrive at each of these two schemes. 


2.1 The grammar of the two schemes A and P 
The grammar of the two schemes is largely the same. Here are the key ideas. 


e One key idea, due to [18], is that Gentzen's polyvalent comma shall be re- 
placed by a strictly binary structure-connective, which we write X o Y. This 
replacement, while not essential, enormously simplifies all of our thinking 
about structural contexts, and in particular our thinking about the display prop- 
erty. 

Furthermore, just as Gentzen used his comma as having one significance on 
the left and another on the right (systematic ambiguity), we shall do the same. 
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Just as in Gentzen, one thinks of o as signifying (something like) conjunction 
on the left, but (something like) disjunction on the right. These two structure 
connectives will be important for the matter of this paper. 


e Another key idea is that there shall be a structure connective permitting one 

to flip from one side of the turnstile to the other-and back again. We use x X .? 
One may think of * as representing (something like) negation. Accordingly 
if xX is an antecedent [consequent] constituent, then X is a consequent [an- 
tecedent] constituent. The star structure connective will figure heavily in this 
paper. 
So far work on display logic has assumed that * is not systematically am- 
biguous, but that assumption is not necessary. That is, freedom from context 
sensitivity is not essential for the fundamental properties of negative struc- 
turing that lead to the display property. That point is not, however, relevant 
to our present work. 


e A further idea is that there shall be zero-place structure connectives, I, that 
do the work that Gentzen did with the empty symbol. As in Gentzen, they 
will be context sensitive (sysematically ambiguous). They represent propo- 
sitional constants that are either truth-like or falsity-like, just as in Gentzen. 
The structure-connectives, I, are not part of the topic of this paper. 


e There is no good reason for proof theory to restrict structural ideas to those 
on someone else's list. Wansing [19] adds a powerful one-place structural 
idea eX that helps enormously in perspicuously representing various modal 
logics. Like I, this idea is not part of our immediate concerns. 


e Display logic in general permits the co-existence of many 'families' of struc- 
ture-connectives of the kinds just described. This key idea, first exploited— 
but without the display property-independently by Dunn and by Minc (see 
[3, $61.1]), is also not a concern here. 


In short, this paper is restricted to considering properties of the structural ele- 
ments X o Y and *X, although one may imagine the presence of other structure- 
connectives such as I or e or even structure-connectives from other families. We 
use X, Y , and Z to range over structures built from formulas by structure-connec- 
tives, and we reserve A and B for formulas. 


2.2 The A and P schemes defined 


We call the principles that secure the display property the ‘display equivalences’. 
There is no reason to suppose that only one set will do. In fact the A scheme of [19] 
employs different display equivalences than those of the P scheme of [4]. Neither 


?We follow [17] in using *«X' in place of the *X «' used in earlier papers. 


84 NUEL BELNAP 


scheme appears strictly more general than the other, nor is it plain exactly how they 
are related. What is offered here is a perspective for viewing the two schemes along 
side of each other. We begin by exhibiting the display equivalences postulated by 
each of the two schemes. 

THE P SCHEME 


The P scheme of [4] defines display equivalence as the smallest equiv- 
alence relation that makes equivalentall consecutions listed on the same 
line below. 


XoYtZ XF4#YoZ 
XFYoZ X oxY -Z Xr ZoY 
XrY YF «X **X-Y 

THE A SCHEME 


The A scheme of [19] defines display equivalence, in so far as it con- 
cerns o and x, as the smallest equivalence relation that makes equiva- 
lent all consecutions listed on the same line below. 


XoYFZ XE ZoxY Y-F*XoZ 
XFYoZ XosZFY *YoXtZ 
XHY *Y F «X Xr xxY 


2.3 Star in the two schemes 


The two schemes treat the negative structuring *.X in the same way. Each yields 

what amounts to full contraposition and double negation as structural principles. 

That is, consecutions on the same line below are display-equivalent each to each: 
STAR DISPLAY EQUIVALENCES 


XHY *Y -*X **X FY Xbt**Y 
X F «Y Y F «X 
aX FY *Y FX 
>, Oe we BAX... 


The last line is supposed to convey a general intersubstitutability property. This 
general property follows from the first line of equivalences only in the presence of 
other display equivalences-differentin the two schemes-sufficient to guarantee the 
display property. | 

There is no claim that these star display equivalences are required to be present 
in any sane scheme; it is just a fact that the A and P schemes share these princi- 
ples. We hold them fixed in our further comparison of the two schemes and in our 
wondering about additional schemes. 
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3 HOW THE A AND P SCHEMES DIFFER 


Where the A and P schemes differ is with respect to the binary structure connec- 
tives. This is not quite easy to see from the way in which each of the schemes was 
originally defined. We shall offer one good way of seeing the underlying principles. 
The idea is to envision each scheme as a different way of realizing the display prop- 
erty for the binary structure connectives. The remainder of the paper follows out 
this idea. 

Consider the display-equivalent consecutions 


XoYFHkF*Z Zk «(X oY) 


Obviously these two consecutions are equivalent solely by an agreed star princi- 
ple (a form of contraposition), without consideration of binary structure connec- 
tives. Furthermore, because of double-star (double-negation) principles that we 
have agreed to hold fixed, there is no loss of generality in affixing a star to Z. Doing 
so has the virtue that all of X, Y, and Z are antecedent constituents of the given 
consecutions, so that we don’t have to worry so much about redundant double entry 
bookkeeping in terms of antecedent and consequent constituents. Throughout the 
rest of the paper, we shall consistently exhibit consecutions in the fashion above, 
i.e. in 'contraposition-pairs' and with all of X , Y , and Z as antecedent constituents. 
It seems to help the eye. 

Evidently in the above consecutions it is Z that is in effect displayed (as an- 
tecedent), while X and Y are grouped together in a certain way. This is critical 
relative to the display property. There are three other pairs of consecutions that 
display Z in this sense, while simultaneously grouping X and Y in one way or an- 
other. Repeating the above pair, this gives us altogether four contraposition-pairs, 
which we think of as a ‘family’. We name it 'Z(XY V in order to remind us that 
Z is in effect set off by itself (displayed), whereas X and Y are grouped together. 

Z(XY) FAMILY 


(1) XoY - «Z Z F- *(X oY) 

2Q)YoXFHFsZ Z F- «(Y o X) 

(3) *(*X o*xY) F *Z ZF *X oY 

(4) *(*Y oxX)F *Z Zr *xYosX 
Then there are four more pairs in which X is displayed: 
X (Y Z) FAMILY 

(5) X F xY o«Z * («Y o«Z) - «X 

(6) X - *Z oxY * (*ZoxY)F- «X 

(7) X F «(Y o Z) YoZE«X 


(8) X - *#(ZoY) ZoY - «X 


86 NUEL BELNAP 


And lastly, there are four pairs in which Y is displayed: 


Y (X Z) FAMILY 
(9) Y - «X o«Z * (*X oZ) - «Y 
(10) Y - «Zos*X * (*ZoxX)t- xY 


(11) Yt «(X o Z) XoZk«y 
(12) Yk *«(Z o X) ZoXtixY 


So there are twelve pairs in all, four pairs in each of three families. Given that 
X, Y , and Z are all taken to be antecedent consitituents, these possibilities are ex- 
haustive. We summarize for easy reference. 

THE FAMILIES SUMMARIZED 


Z(XY ) family: (1), (2), (3), (4). 
X (Y Z) family: (5), (6), (7), (8). 
Y (X Z) family: (9), (10), (11), (12). 


THE DISPLAY PROBLEM 


The display property now sets as a problem that for each entry in each 
family, there must be at least one member of each of the other two fam- 
ilies to which it is display equivalent. 


Were it not so, the display property would fail. One may say that this problem- 
the ‘display problem -is solved in different ways by the A and P schemes. In order 
better to appreciate their differences, we note in passing that there is an ‘easy’ way 
to solve the display problem: just declare all of (1)-(12) as display-equivalent. We 
call this the ‘Easy scheme’: 

EASY SCHEME 


(1)-(12) are all taken as equivalent. 


The Easy scheme is strong. By making (1)-(4) equivalent, it renders the op- 
eration X o Y commutative wherever it occurs, and also identifies X o Y with 
*(*X o «Y ) wherever it occurs. The Easy scheme nevertheless does not trivialize 
display logic. For example, unlike all standard Gentzen consecution calculuses- 
even those for e.g. linear logic that rely on multisets or on a polyvalent comma-the 
Easy scheme does not imply any associativity-type principle such as an equivalence 
between W o (X o Y) F Z and (W o X) oY F Z. But aside from that, we really 
know very little about the Easy scheme. We shall be using it just as a foil. 

Let us turn now to the P and A schemes. Adjusting for relettering and double 
stars, the P scheme takes as fundamental the equivalence between (1), (5), and (6). 
It then ‘notices’ that this gives us an equivalence to (11), thus solving the display 
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problem. Following out this line, we can see that on the P scheme, the twelve con- 
secutions (or pairs of consecutions-we won't worry about this nicety) break down 
into three equivalence classes, where for each class, all the consecutions in the class 
are display-equivalent by the rules of the P scheme: 

THE P SCHEME AS SOLVING THE DISPLAY PROBLEM 


Equivalence class P}. (1), (5), (6), (11). 
Equivalence class P5. (2), (7), (9), (10). 
Equivalence class P3. (3), (4), (8), (12). 


Observe the following. 


l. 


You can see by inspection that each class P; contains a member from each 
of the three families. The P scheme therefore solves the display problem. In 
fact that is what we mean by 'solving the display problem'. 


. It is also easy to see that the equivalences in each class are just reletterings 


of the equivalences in each other class. For example, for the P scheme it 
suffices to postulate the equivalence of the members of Class Pj. 


. You can see that the P scheme is maximal relative to the Easy scheme: 


Adding any equivalence not already present generates the Easy scheme, i.e., 
the universal equivalence on (1)-(12). For example, making any consecution 
from class P, equivalent to any consecution from class P} would by transi- 
tivity make (1) equivalent to (2); which would then by relettering force the 
equivalence of (11) from P, with (12) from P5, and thereby make all of (1)- 
(12) equivalent. 


. You can also see that Class P, contains not one but two members of the 


X(Y Z) family; and analogously for the other two P; classes. In this 
respect the P scheme goes beyond the requirements set by the display prob- 
lem. That's interesting. 


The A scheme from [19] is different. Adjusting for relettering and double stars, 
the A scheme takes as fundamental the equivalence of (1), (6), and (9). By reletter- 
ing and double-star principles, we can see that on that scheme, there are two equiv- 
alence classes rather than three: 

THE A SCHEME AS SOLVING THE DISPLAY PROBLEM 


Equivalence class 41. (1), (4), (6), (7), (9), (12). 
Equivalence class Ag. (2), (3), (5), (8), (10), (11). 


Observe the following. 
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1. Evidently the A scheme contains at least one member from each family. The 
A scheme therefore solves the display problem. 


2. Also the equivalences in each class are just reletterings of the equivalences 
in the other class. For the A scheme, it suffices to take e.g. the equivalences 
of the class A; as postulates. 


3. You can see that the A scheme is maximal relative to the Easy scheme: Ad- 
ding any equivalence not already present obviously generates the universal 
equivalence on (1)-(12). 


4. You can also see that each class A; contains exactly two members from each 
of the three families. In this respect the A scheme goes beyond the require- 
ments set by the display problem—and in an elegantly symmetric fashion. 


It is easy to see that the A and P schemes are incomparable, neither being a re- 
finement of the other. For example, the P scheme identifies (1) and (5), whereas 
the A scheme does not; and the A scheme identifies (1) and (4), whereas the P 
scheme does not. Neither is, it would seem, more general than the other. Com- 
bining the schemes would force the equivalence of all twelve pairs—that is, would 
force the Easy scheme.* 

In one respect, however, the schemes are different. The P scheme relies on the 
identification of (5) and (6), (9) and (10), and (3) and (4). The A scheme, in con- 
trast, could, if it wished, treat its extra identifications, such as that between (1) and 
(4), as ‘accidental’. You can see the truth of this two-part observation in the fol- 
lowing way. 

Start with Equivalence class P}, and choose one of its entries from each family, 
say (1), (5), and (11). You need at least this much of P in order to solve the display 
problem (though you could have solved it by picking (6) instead of (5).) Now since 
(11) also has the form of (1) (relettered), you can easily calculate that we are forced 
by transitivity to add (6) to the equivalence class generated by {(1), (5), (11)}; no 
choice. There is no way to avoid the identification of (5) and (6), given that (1), 
(5), and (11) are put in the same class. 

The upshot is that there is no (proper) refinement of the P scheme that solves 
the display problem. Hence if, like the writer, one starts with the P scheme, one 
can wrongly be led to suppose that there is something ‘unique’ about the solution 
to the display problem that one has found. 

In contrast, however, it is easy to find a refinement of the A scheme, that is, a 
scheme that is more general (forces fewer provable equivalences) while still solving 
the display problem. This is what we shall do in the final section of this paper. 


4It is good to bear this in mind when examining how each scheme enriches itself by the addition of 
further structural postulates; otherwise, confusion is all too easy by which to come. 
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4 OTHER SOLUTIONS TO THE DISPLAY PROBLEM 


The observation that leads immediately to a refinement of the A scheme is this. If 
we start with (1), (7), and (12) as equivalent, no transitivities ever force us to add 
to the class. In the same way, the entire following ‘generalized A scheme'—we call 
it ‘the GA scheme’—can be justified: 

THE GA SCHEME 


Equivalence class G Aja. (1), (7), (12). 
Equivalence class GAijp. (4), (6), (9). 

Equivalence class G Aga. (2), (8), (11). 
Equivalence class G A2. (3), (5), (10). 


This scheme is minimal with respect to the display property: Each equivalence 
class contains exactly one member from each of the families. In contrast with the 
P and the A schemes, no ‘unnecessary’ identifications are made. One should not, 
however, infer too much: it remains true that the GA scheme is not a generaliza- 
tion of the P scheme; the GA scheme remains incomparable with the P scheme. 
As we indicated, the P scheme has no generalization (refinement) that solves the 
display problem. In other words, the G.A scheme is minimal among those solving 
the display problem, and so is the P scheme. 

Also striking is the following. Class G Ag, is a relettering of class G.A14, and 
Class G'Agp is a relettering of Class GA,,. But the equivalences of GA; leave 
those of GAjp entirely undetermined. Having postulated G A1a, you still need in- 
dependently to postulate G A1», if you want the display property. 

There is, however, one maximally elegant way in which one can have the A 
scheme without moving beyond the postulation of (only) the equivalences G.A;,. 
The idea is this.? In display logic X o Y is context sensitive, having one meaning 
or the other depending on whether it occurs in antecedent position or in consequent 
position. In the P scheme this feature is taken as an excuse to take the two ideas 
(X o Y in antecedent position, and X o Y in consequent position) as each prim- 
itive. But given the GA scheme, it also makes perfectly good sense to take only 
one of them, say X o Y in antecedent position, as primitive, while defining X o Y 
in consequent position. In particular, let us define as follows: 


DEFINITION 1 (A) X oY in consequent position shall by definition be 
*(*Y o*X). 


5 Conversations with Dunn on his work in algebraic logic directly prompted this observation. Much 
more to the point, it seems inevitable that the separate and joint algebraic work of Dunn and Allwein and 
Hartonas listed below in the references should turn out to be deeply relevant to the development of logics 
with the display property-and perhaps vice versa. Perhaps, for instance, the Dunn-Allwein-Hartonas 
research can supply an enlightening algebraic or relational semantics for display logic. Certainly the 
ideas of residuation and Galois connection, so fundamental to the algebraic investigations, appear “very 
like' the display property. Nothing, however, is definitely known. 
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Definition A appears circular. It is, however, not circular, because the operation 
o in its definiendum occurs in antecedent position. Thus, the definition instructs us 
to trade in every occurrence of o in consequent position for an occurrence of o in 
antecedent position. 

If we do this, we shall find that we obtain for free—without further postulation- 
the A-scheme equivalences that are between members of the same family, e.g., the 
A-scheme equivalence of (1) and (4), which are both in the Z(XY) family. We 
obtain this equivalence, for example, just by Definition 1. This seems an especially 
beautiful feature of the A scheme. It follows that the A scheme can reduce the 
number of connectives that it needs to think of as primitive. 

The GA scheme is a minimal solution to the display problem (but do not say ‘the 
minimal solution’ !). It is therefore good to ask for its other coarsenings. Evidently 
we obtain another if we replace Definition A by Definition A": 


DEFINITION 2 (A") X o Y in consequent position shall by definition be 
*(€X o «Y ). 


This gives us Scheme A": 
SCHEME A’ 


Equivalence Class A1. (1), (3), (5), (7), (10), (12). 
Equivalence Class A}. (2), (4), (6), (8), (9), (11). 


The A' scheme is properly named since it is evidently but a notational variant 
of the A scheme. In fact one could obtain the A’ scheme by starting with the A 
scheme, and then simply defining a new operation o' in consequent position by X o' 
Y =Y o X. Big deal. 

There is, however, another (technically) nontrivial coarsening of the GA 
scheme. The A scheme resulted from combining GA;4 with GAj,. The A’ 
scheme arose from combining G A14 with G Agp. One could instead decide to com- 
bine G Aj, and G A24. Then independently one could decide to combine G A1, and 
G A2y. We cannot guess, however, if these possibilities might lead to usefully dif- 
ferent ways of constructing display logics. In a similar vein, the relations between 
the P scheme and the A scheme remain unexplored, partly because of not being 
clear on the best questions to ask. Perhaps it would help to have a complete survey 
of all ways of solving the display problem. 


5 SUMMARY 


Because it is easy to lose track of where we are, we conclude with an epistemic 
summary. So as not to finish in professed ignorance, we put what we don't know 
before what we do. 


5.1 


l. 


5.2 
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Summary of what we don't know 


Metaquestion: What are good questions to ask about the relations between 
the P and the A schemes? Surely the two schemes don't just sit there. Are 
they good for different things, or can either do the work of the other, or ...? 


. We do not have a survey of the lattice of all ways of solving the display prob- 


lem. 


. Is the extra generality provided by the GA scheme of use in constructing 


some particular display logic? 


. Is there an application for the logic defined by the Easy scheme? 


. Nearly every logic combines a variety of formula-connectives. Display logic 


analogously invites combining different families of structure-connectives 
into a single logic. For example, the display version of relevance logic in [4] 
involves two families of structure-connectives, one for the relevance 
connectives and one for the boolean connectives. As a second example, some- 
what against the conventional wisdom, display logic permits intelligible com- 
bination of intuitionist logic and classical logic by allotting each a differ- 
ent family of structure-connectives. Heretofore, however, all of the actually- 
constructed examples using multiple families have postulated the same dis- 
play-equivalences for each family, and have been differentiated only via struc- 
tural rules that are not display equivalences. But mathematically and concep- 
tually there is no necessity to maintain the same display equivalences for all 
the combined families. Thus we may wonder if anything of interest emerges 
when one combines two families that are based on different schemes. For 
example, is there an interesting logic that combines a family based on the A 
scheme with a family based on the P scheme, all in the same display logic? 


. One hopes that there is an algebraic meaning to all of this, but if so its nature 


remains to be determined. It may well be available through the work of Dunn 
and Allwein and Hartonas as listed in the references. And vice versa. 


Summary of what we know 


. The A and P schemes each solve the display problem. 


. The A and P schemes are incomparable in the sense that each identifies con- 


secutions kept separate by the other. 


. The A and P schemes are each maximal in the sense that adding any new 


equivalence to either yields the Easy scheme (the universal equivalence a- 
mong (1)-(12)). Consequently they are unjoinable if one wishes not to make 
proof theory Too Easy. 
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4. The P scheme admits of no proper refinement that solves the display prob- 
lem. 


5. The A scheme admits of a proper refinement, the GA scheme, that (also) 
solves the display problem. 


6. One may obtain the A scheme from the GA scheme by definition. 


University of Pittsburgh, USA 
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MARCUS KRACHT 


POWER AND WEAKNESS OF THE MODAL 
DISPLAY CALCULUS 


1 INTRODUCTION 


The present paper explores applications of Display Logic as defined in [1] to modal 
logic. Acquaintance with that paper is presupposed, although we will give all nec- 
essary definitions. Display Logic is a rather elegant proof-theoretic system that 
was developed to explore in depth the possibility of total Gentzenization of various 
propositional logics. By Gentzenization I understand the strategy to replace con- 
nectives by structures. Gentzenization is something of an ingenious optical trick 
because it uses a single symbol to mean different things depending on the place 
it occupies in the sequent. In the original Gentzen system it was the comma that 
had to be interpreted as and when to the left of the turnstile and as or when to the 
right. The interpretation of the structures oscillates between two logical symbols 
depending on whether it is in the antecedent or in the consequent. This is why we 
call symbols like comma Gentzen toggles. These two symbols between which this 
toggle switches are the Gentzen duals of each other. So, and and or are Gentzen du- 
als. The strength of Display logic lies in a rather general cut-elimination theorem. 
In [10] and [9], Heinrich Wansing has refined these methods for modal logics; he 
showed that contrary to Belnap’s own Gentzenization of modal operators as binary 
structure operators, a unary one is more appropriate (not only from an esthetical 
point of view) and makes perfect sense semantically as well. The Gentzen dual of 
the modal operator O is actually not — as one might expect — the possibility operator 
©, but the backward looking possibility operator, denoted here by ©. (To be con- 
sistent with that we write G instead of O and © instead of ©.) The corresponding 
toggle is denoted by e. The reason why this is so natural lies in the fact that it is the 
exact Display or Gentzen dual, for we have that the sequent eB | A and the se- 
quent BF eA are equivalent if e is read as © if in the antecedent and if it is read as 
(3 if in the consequent. Wansing uses this fact to display various modal and tense 
logics à la Belnap by providing some formula introduction rules and basic struc- 
tural rules for K and Kt and then Gentzenizing the additional axioms. The benefit 
lies not only in the homogeneity with which all these systems are now handled and 
the rather clear intuitive background. The benefit lies in the possibility to use the 
general cut-elimination theorem of [1]. 
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During the summer of 1993 Rajeev Goré, Frank Wolter and myself have been 
intrigued by the possibility that Display Logic could be the key to rather simple 
decidability proofs via cut-elimination and some refined tricks of pushing around 
decidability. After successful proofs of the fact that all displayable logics are de- 
cidable — which we knew was wrong — and a subsequent investigation into the pos- 
sibility of having proved the inconsistency of arithmetic we found that one crucial 
lemma was flawed. The headache was soon to follow. Not only were the theorems 
on decidability false, it is actually undecidable whether a display calculus for modal 
logics is decidable. 


The negative results are now assembled in [5]. The present paper contains pretty 
much those parts of the original paper that have remained untouched by the disas- 
ter. I wish to thank Rajeev Goré for bringing my attention to Display Logic and 
his insistance that it is worth its while. Thanks for endlessly discussing this topic 
with me. Thanks also to Frank Wolter, Heinrich Wansing and Greg Restall for their 
criticism and to an anonymous referee for pointing out a number of mistakes in the 
first version. 


2 THEBASIC CALCULUS 


Below we outline the whole calculus for the basic tense logic Kt. This outline is a 
blend of [1], [10] and some own ideas. Kt is a special bimodal logic in which we 
have two pairs of modal operators, namely G and ©, as well as © together with 
©. The pairs look in opposite directions of the basic relation of the Kripke frame. 
We assume G and 9 to look with the relation denoted here by < and © and 9 to 
look in direction of its converse >=<]~. The display calculus works with a special 
set of extra operators used to Gentzenize logical symbols into structures. These 
operators are I, o, * and e. Given a set L of formulas we write Gtruc(L) for the 
algebra of structures over L. Gtruc(L) is actually nothing but the term algebra over 
L with the operators I, x, e and o. We distinguish formulas from structures from 
sequents. A sequent is of the form X - Y where both X and Y are structures. X is 
called the antecedent and Y the succedent. Succedents are called consequents in 
[1] and instances of rules are called consecutions. Formulas are denoted by upper 
case letters such as P, Q, R and structures by X,Y, Z. Sequents are pairs X - Y 
where X and Y are structures, and sequents are denoted by lower case letters such 
as s, t. We begin with the following fundamental logical axioms and rules: 


X- -P PHY 


(Id) pt p (Cut) yEy 
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and the following basic structural rules: 


XoYtFZ XoYFZ 
XFZoxyY YF*XoZ 
X F-YoZ XFYoZ 
X o*Z- Y xYoXEL-Z 
*X -Y X F «Y 
xY - X YE «xX 
ke X FY XE x*Y 
Xt*Y XEY 
Xb eY 
eX -Y 


Notice that contrary to the established notation we write * as a prefix, which makes 
formulae much easier to read. The following rules are now derivable: 


XFrFY 
xY eX 


These rules alone suffice to prove the following theorem, whose content gave rise 
to the name display logic. 


THEOREM 1 (Display Theorem) For every sequent s and every antecedent (suc- 
cedent) part X of s there is a sequent s' structurally equivalent to s such that X is 
the antecedent (succedent) of s'. 


The correctness of this theorem depends on a proper definition of the terms an- 
tecedent part and succedent part. 'This is defined here via positive occurrence in, 
where an occurrence is positive if it is nested by an even number of +. Namely, if 
P, Q are formulae then «P + *Q is a sequent but neither P nor Q can be displayed 
in their original position. However, we can easily derive Q | P, displaying both P 
and Q on the other side. So if s = V - W is a sequent and X occurred negatively 
in V, it will then occur positively in «V and so it can be displayed in «W F «V. 
The theorem is correct with the following definition as given in [1]. 


DEFINITION 2 In a sequent V + W an occurrence of X is an antecedent 
part if it occurs positively in the antecedent or negatively in the succedent. An 
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occurrence that is not an antecedent part is a succedent part. Equivalently, it 
is a succedent part if either it occurs positively in the succedent or negatively in the 
antecedent. 


It is actually possible to compute the sequent s' from s and the occurrence of X. 
Namely, we can consider s to be of the form V - W, where V, say, contains an 
occurrence of X. This occurrence may be either positive or negative. Then we can 
think of V as obtained from this occurrence of X by applying only unary functions, 
for example prefixing with e or ‘multiplying’ from the left with Y, that is, applying 
the function X +> Y o X, or multiplying with Y from the right. Each of these 
functions has a dual function; the function e : X + eX is it’s own dual, X + 
Y o X has the dual X > «Y o X, X +» X oY has the dual X + X o «Y. * has 
no dual in this sense, which makes this story a bit complicated. Without * we do 
the following. Let V — f(X), where f is a unary polynomial in the termalgebra 
of structures, and let f? be the dual of f. Then we transform f (X) W into X H 
f°(W). This transformation is reversible, so from X + f°(W) we can also derive 
f(X) + W. As noted, * creates a problem. To undo * we have to make antecedent 
and consequent swap sides, so we move from «X | W to «W H X. With this 
proviso, it is justifiable to say that « is self-dual. It is now clear why the Display 
Theorem has this peculiar restriction concerning the part-of relation, because for X 
to be displayed on the same side as it occurred in the original sequent we need to 
perform this swapping an even number of times. The Display Theorem is brought 
here into the following form, writing Pol; (Struc) for the set of unary structure 
polynomials. 


THEOREM 3 Let f € Pol, (Struc) which embeds an occurrence of its argument 
an even number of times with x. Then there exists a f? € Pol, (Struc) such that: 


fX)rw 
X F f*(W) 


displaying that particular occurrence of X. Moreover, for f = g^, f^ = gisan 
appropriate choice. 


Let us note that the calculus has an inbuilt symmetry or self duality in the following 
sense. Define the dual (X + Y )^ of a sequent by (X H Y)^ =Y F X. The dual 
II^ of a proof II consists in the dualization of all sequents. 


THEOREM 4 (Symmetry) For every proof Tl of X + Y, II^ is a proofofY + X. 
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Now we define the following equivalence between structures. X and Y are called 
similar, in symbols X ~ Y, if for every Z the following two conditions hold: 


XFZ ZEX 
YFZ Z-Y 


Similarity thus means that X and Y are interchangeable in a proof both as ante- 
cedent and succedent modulo some reversible rules. Replacement of Y for X in 
a given proof does not necessarily yield another proof but it can be made into one 
by adding some extra steps. The following theorem is a direct consequence of the 
Display Theorem. 


THEOREM 5 z is a congruence on Gttuc(L). 


Proof. Clearly, z: is an equivalence relation. Moreover, as in our calculi we have 
full substitutivity, the only thing to be checked is whether ~ satisfies the genuine 
congruence property that if |] is an n-ary function symbol and X; + Yi,..., X4 Z 
Y, then also (X, . .., X4) © (Yi, . .., Yn). It can easily be seen that it is enough 
to show this for unary polynomials. Let f be such a polynomial, and consider an 
occurrence of its argument X . Let g be its dual with respect to this occurrence. Two 
cases have to be distinguished, namely whether or not f(X) embeds X an even 
number of times. Let the number be odd. Then we deduce from f(X) | Z that 
g(Z) - X and then g(Z) + Y, by assumption that X % Y, and then f(Y) + Z. 
If f embeds X an even number of times, we deduce X | g(Z) instead, and then 
Y F g(Z) and finally f(Y) + Z. Similarly for X and Y in the succedent. H 


3 THE MODAL DISPLAY CALCULUS 


Let us now define the full calculus for the basic modal logic K and the basic tense 
logic Kt. It will be called DLM. In addition to the rules of the base calculus it 
has rules to introduce connectives and some more structural rules. The operational 
rules are the following: 


(F T) IFT (T F) ira 
TFX 
XFHI 
(= L) ——— (Lr) LHI 
XFL 
XHFrFxP | *xPrx 
(a) SH) -——— 


XFnxAP aPHrX 
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(F A) 


(F V) 


(==) 


(FG) 


(- 9) 


(© F) 


(E F) 
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XFP YF 
AFP YFQ (A F) 
XoYFPAQ 
XHFP 
XrFPoQ (V -) 
XFPVQ 
XoPF 
XF-P>Q 
X-FP 
wars (G H) 
Xr QGP 
XFrFP 
— (o F) 
*xex*X - OP 
PtreX 
Ce (F ©) 
PHX 
PHX 
— (F 5) 
EPrs*esX 


PoQHF X 


PAQFX 


PHX QFY 


PVQEXoY 
XFP QFY 


Po QkE4«xoY 


PFX 


GPF èX 
xexPr-X 


oPrX 
Xr P 


eX + OP 
XbFx*exP 


Xr HP 


The calculus for K will be obtained by deleting the introduction rules for the con- 
nectives E] and €. Notice the complete duality between © and © as well as between 
and G. We can formalize it by extending the duality map as follows: 


p^ = p 
(PAQ) = P^vQ^ 
(BP)^ = e(P^) 
(EP) = e(P^) 
I^ = I 
(XoY)^ = X^oy^ 


(^P) = P^) 
(PVR) = P^AQ^ 
(©P)4 = m(P^) 
(OP)4 = «8(P4) 
(«X)^ «(X^) 
(°X) = e(X^) 


THEOREM 6 (Duality) For every proof II of X + Y, II^ isaproofofY ^ + X^. 


Of course, this time we can only speak of duality, not of symmetry. Finally, the 
following structural rules are added. (If only a part of these rules is added, we have 
a substructural calculus. Much of what will be proved here applies to substructural 
logics as well; in a sense, the full calculus is the most difficult case, and this is the 
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reason why we concentrate on this calculus.) 


XEZ XEZ 
IoXt-Z X F- IoZ 
I-Y XFI 
xI-Y X F xI 
XFZ XFrZ 
(Wl) ——— (Wr) ———— 
YoXFZ XoYFZ 
Xi10o(X20 X3) - Z Zł- X1o(X20X 
(Al) 1 ( 2 3) (Ar) 1 ( 2 3) 
(X10 X3)o X3 - Z Z F (X10 Xe) o X3 
XoYF-Z ZEX oY 
(Pl) — ——— (Pr) ——— — — 
YoxXtZ ZEYoXx 
XoXFZ ZEXoX 
(cl) ——— (Cr) —— —— 
XrZ Zr X 
IFY X FI 
(Ml) ——— (Mr) ———— 
eIt Y Xt eI 


This concludes the definition of DLM. There are a number of things which have to 
be explained. First of all, some rules originally proposed in [10] have been left out 
because they are derivable. On the other hand, a lot of structural rules have been 
added to the calculus, mainly the duals of existing rules. The one-sided rules are 
sufficient, but this has the effect of disturbing the duality on the level of proofs (not 
for provability, since the two-sided rules are derivable from the one-sided rules). 
The necessitation rules have been changed; they are now fully structural and do 
not require the use of formula variables. Furthermore, there are now two more ne- 
cessitation rules derivable with the help of the newly introduced rules (Q), namely 


I-Y X FI 
(DI) ———— (Dr) —————— 
xexsI-Y Xk xexI 
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For example, the first is derived as follows: 


XI 
ETTUE 
xI *X 
IF «X 
elt «X 
I-e«X 
xIF ox X 
ox Tb *X 
Xt *exI 


Define the following translation for sequents: 


T(X - Y) 2 n(X) 2 n(Y) 


n(P) = P T2(P) = P 

7 (I) = T T2(I) = 1 

ni (*X) = n(x) T2(*X) = n(x) 
m(XoY) = m(X)^m(Y) T((XoY) = H(X)VWR(Y) 
Ti (eX) = n(x) T2 (eX) = B(X) 


With respect to this translation we have the following theorem. 


LEMMA 7 Let X + Y bea sequent. From X + Y the sequent I+ r1(X) > 
T2(Y ) is derivable in DL M. 


Proof. We prove that if X + Y is derivable, then so is r3 (.X) F r2(Y ). It is not 
difficult to put the proof to a conclusion from there. The remaining proof is an in- 
duction on the complexity of the sequent. In fact, we will again prove something 
stronger, namely that we can derive any V H W which arises from X F Y by re- 
placing any given substructure by its proper translation (i. e. by its 7; -translation or 
by its 7)-translation depending on whether it occurs as antecedent part or as succe- 
dent part). We will not go through all the cases. But take the case of a sequent 
X o Y F Z and assume that the claim has been verified for X and Y. We wish to 
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show it for X o Y. Here is a proof. 


XoYF-Z 
XFZoxY 
n(X)FZoxY 
Y - «n(X)oZ 
nr (Y)F *n(X)oZ 
n(X)on(Y)-Z 
M(X)AR(Y) FZ 
n(XoY)FZ 


Another interesting case is eX F Z. 


eX -Z 
X F eZ 
n(X)beZ 
on(X)FZ 
n(eX)- Z 


It is easy to supply the remaining cases. a 


THEOREM 8 A sequent X + Y is DLM-provable iff r(X - Y) is a theorem of 
the tense logic Kt. 


Proof. The correctness of the display calculus is a matter of straightforward ver- 
ification. Notice that the display calculus contains modulo translation only rules 
which are derived rules of Kt, such as (M N+) P/GP, and (M N^) P/BEP, which 
are consequences of the rules (MI) and (Mr) (modulo some other rules). The com- 
pleteness is somewhat tricky. Consider the set O of all formulae P such that I - P 
is derivable in DLM. Moreover, consider the set ©, of all r( X + Y) which are 
derivable in DMLE, defined below. By Lemma 9 below we know that a sequent 
is derivable in DLME iff it is derivable in DLM, and that sequents with identical 
T-translation are interderivable. We conclude that © = OL. Now it remains to 
be seen that O is a normal modal logic, which will establish the completeness. If 
P € O, then also GP € O and OP € O, as can easily be shown. Next, if P € O 
and P — Q € O, then not only I - P — Q is derivable, but also P - Q. By an 
application of (Cut) we get I + Q, showing Q € ©. It still needs to be shown that 
all boolean tautologies can be proved, but we refer here to [10]. a 


102 


MARCUS KRACHT 


Here are the additional rules for the calculus DLME. 


(oH) PAQFY (F o) XFPVQ 
PoQFY XFr PoQ 
aPFY Y FaP 

(+ F) Pry (x) —————— 
*PHY Y F«P 
THY AFL 

(IF) — — ("D —— — 
I-Y X FI 
PHY X FOP 

(eH) ———— (Fe) ———— 
ePr-Y X Fr eP 

weet) SPE quu) XE 
xexPH Y Xk xexQ 


LEMMA 9 All rules of DIL. ME are admissible in DL M. Moreover, if X1 - Yi 
and X» t+ Yo have identical r-translation, then they are interderivable in DLM E. 


Proof. The proof is by showing that the highest application in a proof tree of such 
a rule can be eliminated without adding new instances of such rules. To make life 
simple, we will assume that DLM admits cut-elimination. (This will be proved in 
the next section. This assumption is not strictly speaking necessary, but it simplifies 
the argument.) Thus we can assume our proofs to contain no cuts. Now take a 
highest instance of an elimination rule, say of (o ). It's premiss is of the form 
PAQ F Y. Now trace the occurrences of P A Q backwards. Each occurrence 
below the line has one (in the case of contraction two) counterparts above the line, 
unless, of course, P ^ Q is principal (see next section for a definition). This can 
only be in an application of (A +). Replacing the traced occurrences of P A Q by 
PoQ will transform valid instances of rules in valid ones, with the exception when 
P ^ Q is principal. Then replacing P ^ Q below by P o Q will result in a trivial 
deduction, which can be omitted. a 


4 CUT-ELIMINATION AND THE SUBFORMULA PROPERTY 


[1] lists eight conditions on a proper display logic called (C1) — (C8) and proves 
the following: 


THEOREM 10 Any display calculus satisfying (C1) has the subformula property, 
that is, any cut-free proof of the sequent X - Y contains only structures over sub- 
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formulas of formulas in X and Y. 
THEOREM 11 Jn any display calculus satisfying (C2) — (C8) (Cut) is eliminable. 


Now what are these conditions (C1) — (C8)? We will give the original conditions 
in some slightly less general form. The difference is that we have stated a rule of 
simultaneous substitution (C'6/7), which is the appropriate choice to make in this 
context. Belnap assumes that in each rule we first stipulate a set of constituents 
and an equivalence relation on parameters called congruence. Here, parameter is 
an occurrence of a structure in a rule. 


(C1) Each formula which is a constituent of some premiss of a rule p is a subfor- 
mula of some formula in the conclusion of p. 


(C2) Congruent parameters are occurrences of the same structure. 


(C3) Each parameter is congruent to at most one constituent in the conclusion. 
Equivalently, no two constituents of the conclusion are congruent to each 
other. 


(C4) Congruent parameters are either all antecedent or all succedent parts of their 
respective sequent. 


(C5) If a formula is nonparametric in the conclusion of a rule p it is either the en- 
tire antecedent or the entire succedent. Such a formula is called principal 
formula of p. 


(C6/7) Each rule is closed under simultaneous substitution of arbitrary structures 
for congruent parameters. 


(C8) If there are inference rules p and o with respective conclusions X + P and 
P+ Y with P principal in both inferences (in the sense of C5) and if (Cut) 
is applied to yield X + Y then either X | Y is identical to either X F- P or 
P+ Y; or it is possible to pass from the premisses of p and o to X F Y by 
means of inferences falling under (Cut) where the cut-formula always is a 
proper subformula of P. 


These conditions actually need some exegesis. First of all, the present formulation 
assumes that the so-called analysis is performed at the actual proofs, not at the rules 
as presented above. To see the difference, let us call a rule a consecution, and a 
rule skeleton a consecution formulated with the help of structure variables. A rule 
skeleton can be instantiated to a rule by substituting structures for structure vari- 
ables. Notice that rules still contain variables, but only for formulae. We assume 
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that any substitution instance of structures for structure variables and of formulas 
for formula variables is an instance of the rule. Rather than applying the analysis 
to the instances of the rules (which have no structure variables!), we perform the 
anaysis on the rule skeleta of the calculus DLM as presented above. Hence, for 
our purposes, a parameter may be thought of as an occurrence of a structure vari- 
able in a rule skeleton. In the actual Display Calculus, however, they are merely 
metavariables standing in for arbitary occurrences of structures. This is the way 
we will handle the Display Logic later as well. But let us suppose for the moment 
that they are genuine variables of the system. Then formula variables can never be 
parameters. Furthermore, if a structure X occurs parametric once, then all occur- 
rences of X are parametric. Under our interpretation, then, (C2) is trivially sat- 
isfied. Moreover, (C 6/7) is satisfied as well by force of our definition. Only the 
remaining conditions are ever to be checked in DLM. In the associativity law (A), 
for example, all variables are parameters, and all occurrences of the same variable 
are congruent. In a formula introduction rules such as (F ^) the new formula in 
the conclusion as well as the old ones in the premiss are typically not parameters. 


With the exception of (C8) the conditions are verified by direct inspection. We 
leave it to the reader to verify (C1), (C3), (C4) and (C5). For (C8), [10] gives a 
proof that (C8) holds for OQ and GQ as cut-formulas. The case of E1Q and ©Q 
is completely dual, i. e., obtained by swapping antecedent with succedent, so we 
might actually skip the proof here, but for the sake of completeness we give the 
corresponding proof of (C8) in these cases. So let us first suppose that the cut- 
formula is OQ. Then we have 


XFQ Qk «Y 
eX - OQ oQrFY 
eX -Y 


The following, however, is a proof involving a cut on Q rather than 6Q. 


XFrQ Qk eY 
Xk eY 
eX FY 


Suppose next that we have a proof involving E1Q as a cut-formula. 


X | *exQ QFY 
XFAQ HQ F xexY 
X F *exY 
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The following is a proof with a cut on Q. 


X F xexQ 
xex X F Q QrY 
xexX | Y 
xY | ex X 
+Y F *X 
X F xexY 


5 PROPERLY DISPLAYING EXTENSIONS OF KT 


The usefulness of Display Logic shall be demonstrated with a theorem which 
shows that a large class of logics have a canonical proof system in Display Logic. 
We will analyse exactly which rules 6 = P4... P/Q can be incorporated into 
DLM by just adding another structural rule that does not destroy the properties 
(C1) — (C8). Such a rule captures a rule via its 7-translation in modal logic as fol- 
lows. Let us be given a logic A = Kt + A that is, an extension of (Hilbert-style) 
tense logic by a set of rules A, and assume that we add to DLM a set R of struc- 
tural rules. Then we say that DLM + R properly displays Kt + A if (C1) — 
(C8) are satisfied and every derived rule of Kt + A is the 7-translation of a de- 
rived rule of DLM + R. The latter condition can be rephrased as requiring that 
a rule transition is derivable in DLME + R iff its 7-translation is a derived rule 
of Kt + A. For, by the fact that the added structural rules preserve the properties 
(C1) — (C8) we have cut-elimination, and an analogue of Lemma 9 holds. Then 
in DLME + R, 7-equivalent sequents are interderivable, while in DLM + R we 
can only go from sequents to sequents if we do not eliminate connectives. Never- 
theless, it is easy to see that if a rule is DLME + R-derivable, then there is another 
rule with identical 7-translation which is derivable in DLM + R. In this section we 
will give a complete characterization of properly displayable Kt-calculi. We show 
first that the contribution of a rule p = $152... Sm/t can be directly computed as 


p=7(81)..-T(8m)/T(t). 


LEMMA 12 Suppose that p is a structural rule, and let DLM + p satisfy (C1) — 
(C8). Then DLM + p properly displays Kt + j. 
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Proof. Two things need to be seen. First, any extension by structural rules satisfy- 
ing the display conditions axiomatizes a normal logic. And second, that it axiom- 
atizes the logic as given. Let © be the set of all r( X + Y) which are derivable in 
DLME + p. Since the rules of the calculus turn into axioms of Kt + p, we have 
proved now that we have displayed a logic at most as strong as Kt+ 5. On the other 
hand, suppose that for some substitution c we have proved P7? ,..., P2. Then there 
is a substitution w of formulae to structure variables such that P? = T(s;"). Then 
if we can derive r(s;^) we can also derive T(t”) = Q°. Thus f is a derived rule 
of the calculus DLM + p translated under 7. E 


For example, adding the rule 
Tk eX 
I-X 


This rule translates into the rule G P/P, a rule which is actually admissible in Kt, 
but not derivable. In the calculus obtained by 7-translation it will however be a 
derived rule. 


Now, modal logics are generally studied as axiomatic strengthenings of a basic 
system rather than a strengthening by proper rules. Therefore, let us concentrate 
on the question of displayability of axiomatic extensions. First we will consider 
which axioms admit a resolution into a structural rule. Denote by Kt ® 6 the least 
normal logic extending Kt which also contains 6. 6 can always be written in the 
form A — B, where A and B are free of —. We can then pass from the axiom 
A > B to a sequent rule 


Bry 
AFY 


This sequent rule is as powerful as the axiom. For putting Y = p, a variable not 
contained in A or B the 7-translation is the rule 7, (B) — p/7(A) > p. Now let 
p = (B); then the premiss of 6 becomes a theorem and we see that 7, (4) — 
7(B) is an axiom of the calculus axiomatized by this rule. The axiom allows to 
derive the rule, however, and so the two are equal in power. Assume now that both 
A and B are composed from T and propositional variables using only A, V, © and 
©. Then, by standard equivalences, 


A. 8. V Ci B. G.M D; 


im j<n 


where all C;, D; are composed from variables and T with the help of A, © and © 
only. (Thus, disjunction has been eliminated.) Instead of the rule above we then 
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equip our calculus with the rules p1,..., Pm where each p; is of the form 


D,FY,...,Dn FY 
Ci- Y 


Again it is checked that this new calculus is equivalent in power with our logic. 
Finally, define the translation o from formulae into structures via 


o(T) = I 

o(p) = p 

c(P^Q) = a(P)oa(Q) 
c(oP) = eo(P) 
o(OP) = xexc(P) 


It is checked by induction that 7; (c(P)) = P for every formula P made from 
variables, T, conjunction and possibility operators. Thus we have eliminated V. 
Finally, then, let us replace the above rules p; by their Gentzenized sisters. 


c(Di) -Y,...,c(D4,)- Y 
c (Ci) FY 


So, for axioms of the form A — B, where A and B are positive and free of (3 and 
E] we have managed to write a display system that completely axiomatizes it. For 
the rules above (C1) is obviously satisfied since we are strengthening the system 
for Kt by structural rules. (C4) and (C5) are verified by the eye. The condition 
(C8) has to be checked only with respect to the rules which are introducing for- 
mulae, and this has been done already. We are left with (C3). (C3) is actually not 
automatically valid. In fact, we must place the restriction on the formula A that it 
may contain each variable only once. (Maybe only those which already occur in 
B, but the others can be eliminated.) Let us now agree to call a formula primitive 
if it is of the form A — B where both A and B contain only variables, T, ^, V, © 
and © and that A contains each propositional variable at most once. 


LEMMA 13 Suppose A is a tense logic axiomatizable by primitive axioms. Then 
A can be properly displayed. 


Proof. A can be displayed in the way described above; the display calculus meets 
(C1) - (C8) and therefore enjoys cut-elimination and the subformula property. 
The system is complete for A in the sense that it derives X + Y iff mı (X) Fa 
T9 (Y). HN 
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Now let us tackle the question of what logics are defined by a display calculus. 
We will show that it is actually the same class of logics if we insist that additional 
rules are completely structural. To see this consider a rule 


Xi F Yy,..., Xn F Yn 
V-W 


where all X;, Y;, W contain only structure variables. By (C4), congruent formu- 
lae do not swap sides, and so they are either in the antecedent throughout or in the 
conclusion throughout. Now rewrite this rule into the following form. 


IF xX1oY;,...,I- «X, oY, 
I-«VoW 


* 


p 


p and p* are equivalent and p* satisfies (C4) as well as (C3) if p does. The other 
conditions are harmless. This rule translates into the axiom 


To(*X1 o Yi), ..., T2 (X4 o Yo) /To («V o W) 


so that we now have to worry about which formula can occur as a translation of a 
structure under 72. Dually, we can solve the question of which formulae are trans- 
lations under 71. 


LEMMA 14 P = ™(X) for some X containing no formula connectives exactly 
if P d- Q for some Q built from variables and negated variables, T, ^ and the 
diamonds © and ©. 


Proof. We know that if X ~ Y then 7,(X) 1- r(Y) so that without loss of 
generality we can assume that X is in normal form (see Section 9) 


X —pi,9...0pi, 0*pj, o...o*pi, oeX;o...oeX,, oxeYi o... o xeY, 


Then 
n (X) = A Di, A A Pj, ^ A 71 (Xz) A A O71 (*Yu) 


r<k s<l t<m u&cn 


Now do induction on the number of nested occurrences of e. BE 


There remains now only the problem of negated occurrences of variables to prove 
the characterization. Therefore consider a variable p that occurs negated in 7; (V o 
xW). Then replace p in the axiom throughout by ^p. As it occurs negated through- 
out (by (C4) and the definition of part of), after substitution it occurs doubly ne- 
gated throughout and so we can eliminate the negation altogether for p. Similarly 
for variables occurring only in the premisses. Thus we have proved the following 
characterization. 
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THEOREM 15 An extension of Kt by rules can be properly displayed iff each rule 
P, ... Pm F Q is such that all occurring formulae are built from L and variables 
with the help of ^, V and G, Ð. 


We can obtain as a special corollary a characterization of properly displayable ax- 
iomatic extensions. By the condition of completeness of DLME + 6 with respect 
to the logic Kt @ ô itis possible to display Kt @ ô by rules of the form 


Xi1F Y,...,X.- Y 
VEY 


Call these rules special. They are characterized by the fact that one variable is 
shared as the common succedent or antecedent. For on the one hand, putting Y = 
p, a fresh variable, the rule corresponding to p is 


p n(Xi) —Dp,... ,m(X«) — p/ni(V) >p 


Putting p = V; 71(X;) we derive the axiom rj (V) — V; r1(Xi). This is a prim- 
itive formula. This formula if taken as an axiom is as least as strong as the rule 
p. So, by the completeness of DLM + p for Kt + 6 we get the completeness for 
Kt @ 71(V) ^ V; (X). On the other hand, any axiom can be brought into the 
form A — B, and hence is characterized by a special rule. This special rule must 
be derivable in DLME. By the condition that A and B are translations of structures, 
we ultimately arrive as before at the requirement that A — B is primitive. But then 
it can be properly displayed, as we have seen. 


THEOREM 16 (Proper Display I) An axiomatic extension of Hilbert-style tense 
logic can be properly displayed (by structural rules over DL M) iff it is axiomati- 
zable by a set of primitive axioms. 


6 SEMANTIC CHARACTERIZATION 


It is possible to characterize exactly the semantic conditions that can be associated 
with primitive axioms. It is known from correspondence theory ([7] and [8]) that 
primitive formulae are canonical and therefore complete; in addition the condition 
that the formula places on the canonical and Kripke frames is elementary. Yet it 
is also important to know what condition a particular axiom expresses. [8] devel- 
oped the method of substitutions to find the elementary equivalent of a Sahlqvist 
formula but we find this method not so user-friendly. Instead we use the technique 
of decisive sets as proposed in [4]. Some results have also been established in the 
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somewhat simpler [3]. This method rests on the fact that the condition expressed by 
an axiom can be squeezed out by very special valuations. These valuations decide 
what elementary condition this axiom determines in the sense that to know what the 
condition is we just have to check these valuations. Equivalently, to test a Kripke 
frame for whether it accepts a given Sahlqvist formula we only have to use these 
decisive sets as values of the propositions (or, even better, this can be fine tuned 
to decisive valuations rather than decisive sets). On Kripke frames the question is 
then settled. On generalized frames we have two choices: either these valuations 
exist — and then the elementary condition is forced on the underlying Kripke frame 
by these general arguments — or the sets are the infinite intersection of admissible 
valuations. In this case if the frame is compact as a topological space it guarantees 
that a family of valuations converging to a decisive valuation has the same effect 
as the decisive valuation itself. In this case we can simply pretend that the decisive 
sets are internal in the generalized frame. (Or, equivalently, that they can be added 
without disturbing the sets of Sahlqvist formulae accepted by the frame.) 


It turns out that for the axioms in question the singleton sets {w},w € f, are 
decisive. This is most welcome in practical computation, because then we can do 
what every student of modal logic (including the author) is at one stage always 
tempted to do; namely, to pretend that a variable stands for being at a certain world. 
For example, in the axiom Op A Oq. — .O(p ^ q) let us pretend p means being 
at £p and q means being at x,. Then this axiom tells us that if we can see ©, and 
£q then £p and x, must be equal. Hence, it tells us that any point has at most one 
successor. According to [4], the elementary properties defined by Sahlqvist tense 
formulae can be described as follows. They are of the form (Vx) where ® is com- 
posed from positive formulae z «1 y, x — y with the help of ^, V and the so-called 
two way restricted quantifiers (dy > x), (3y < x), (Vy > x), (Vy < x) in such a 
way that in a subformula x < y, x = y at least one of z and y is hereditarily uni- 
versal, which means that it is not inside an existential quantifier. Modal Sahlqvist 
formulae differ only with respect to the quantifiers (dy «1 x) and (Vy «1 x), which 
may not be used. Let us call ó = (Vz)x a primitive formula (modal or tense) if it 
is of the described form and no universal quantifier is in the scope of an existential 
quantifier; hence it has the form (V)(3) (with the appropriate restricted quanti- 
fiers) where V is positive and in an atomic subformula z < y or x = y at least one 
of x and y is hereditarily universal. 


THEOREM 17 Suppose that § is a class of modal or tense Kripke-frames described 
by some finite set of primitive sentences. Then the modal logic of § can be properly 
displayed. 


Proof. It suffices to derive in the calculus of first-order equivalents as described in 
[4] that a negative, bounded 3V-sentence is equivalent to a sequence N &*B where 
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Jt is negative and free of G, EJ and 5f is positive and free of ©, ©. E 


This seems surprising, because well-known systems such as S5, Alt; etc. are often 
axiomatized using O (= O). But this need not be so. The axiom of transitivity can 
be rendered as OO p — Op, the axiom of reflexivity as p — Op, and the axiom of 
symmetry as p ^ Oq — O(q A Op). The quasi-functionality can be axiomatized 
by Op ^ Oq > O(p^ q) rather than Op > Op. 


COROLLARY 18 All elementary subframe logics can be properly displayed. 


Proof. If A is elementary, say, described by 4, then ó is a restricted universal sen- 
tence. Moreover, by a result in [4] if à is universal and it’s class is closed under 
p-morphisms, then ¢ is equivalent to a universal and positive sentence. ! Thus 
elementary subframe logics fall under the class just mentioned in the theorem above. 

E 


COROLLARY 19 All r-persistent subframe logics can be properly displayed. 


Proof. R-persistent logics are elementary, due to a result by [2]. a 


7 NICE RULES AND MODAL DISPLAYABILITY 


This section discusses the possibility of writing nice rules for modal axioms. Two 
things will be shown; first, that the extra strength added by the fact that we have 
the tense dual of the modal operator allows to state rules quite concisely in some 
cases. And second, that the extra strength does not allow to axiomatize more log- 
ics. We will demonstrate the first point with a particular example. Suppose, we are 
interested in a display rule for .3. 


3: OpANSg. > .O(pA Yq) V O(GA Sp) V O(pAq) 


This axiom is already in rather perspicuous form, it is primitive and we can translate 
it directly into a display rule. 


x*e*(X oxexY)- Z xex(X oY) Z x0x(Y 0 x0tX) b+ Z 
xex X oxexY F Z 


1 The proof there is actually highly incomplete, as I recently found out. A full proof can be found in 
[6]. 
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Semantically, it has the following content. 


(Vw)(Vz > w)(Yy > w)r-yVyazvzray) 


If we are sitting at the root of the tree (marked by a star), thus seeing two points, 
then these points are either identical or one of them sees the other. 


In standard modal logic, one can do no better. However, although it is modal 
logic we are doing, the display calculus uses a Gentzen toggle and so we have im- 
plicitly the power of tense logic in the display calculus. The Gentzen toggle is there 
anyway, so we can use the extra power it gives us by rewriting the axiom .3 accord- 
ing to the possibilities of tense logic. Notice, namely, that in tense logic we can 
generate the tree in the picture from any point we wish because we can look both 
ways. Consequently, we can shift the reference point of the axiom from the root of 
the tree to one of the branches. 


Read from there, the semantic characterization is as follows 
(Vw)(Vz < w)(Vy > z)(y-wVvyawVvypb w) 
Put into a tense formula it looks like this 
p. > pVOopV Op 


This axiom uses far less symbols and only one variable. In modal logic, there is no 
way to axiomatize 84.3 over S4 with the help of just one variable, even though there 
are different ways of writing .3 (but effectively only one primitive way). Hence we 
can axiomatize .3 in DLM with the following rule 


XFY eXtFY xexXF- Y 
e*kexXL--Y 
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Now for the second claim that we cannot axiomatize more logics. The Proper 
Display Theorem I states that a tense logic is properly displayable iff it can be ax- 
iomatized by a set of primitive formulae. This can be strengthened to modal logics 
using some model theoretic techniques. Let us begin by introducing generalized 
restricted quantifiers. We put 


(Vy Do x)ó(z, y) 
(Ay >o x)é(z. y) 
(Vy P1 x)ó(x. v) 
(Ay P541 x)ó(z, y) 


p(z, x) 
(x, x) 
p(x, x) A (Yz b z)(Vy >n z)ó(z, y) 
p(z, v) V (3z > z)(3y Pn z)ó(x, v) 


I 


These quantifiers quantify over sets of points which can be reached within a fixed 
number of steps. Recall that a modal formula is called restricted if it is built from 
atomic formulae with the help of the quantifiers (dy > x), and (Vy > zx). It is 
equivalent to require that the generalized restricted quantifiers be used. Our aim 
is to show that if o is primitive and characterizes a modal class of frames then 
o is equivalent to a primitive modal formula. The problem here is to get rid of 
the quantifiers (Vy < x) and (dy < zx). In order to do this, we take a detour. 
Clearly, a can be written in the form (VZ) (Sy) 0(Z, y), with 9(z, y) quantifier free, 
using unrestricted quantifiers. To simplify the notation, we use a single variable 
x instead of z and likewise a single variable y for y. Define now the formulae 
ók(x) = (Ay bx x)ó(z, y). Then 


óo(z) F à1(z) F d(x) F ... F (3y)ó(o. y) 


It is therefore enough if we show that there is a k such that (Sy) (a, y) F p(T). 
Suppose that this is not so. Then the following set is consistent. 


X = ((Vzr)(3y)ó(z. y), (3y)6(u, v), 
(Vy Do u)^ó(u, y); 
(Vy Dy u)^o(u, y), 
(Vy 52 u)^9(u, y),...) 


Thus there is a model Mt for X. Let N be the submodel generated by u. This model 
consists of all points which can be reached from u in a finite number of steps. Since 
œ is a modal formula, it is preserved under taking generated submodels and we 
find that N E: o, by which also N f= (3y)ó(u, y). Hence for some w c N, 
N EP (u, w). Since ¢ is constant, it is reflected under generated subframes. So 
M H| (u, w) as well and thus M — (Jy >, u)d(u, y) for some k, since w can 
be reached from u in a finite number of steps, according to the definition of the 
submodel. This, however, contradicts our assumption on 9)t. We conclude that X 
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is inconsistent. Hence, we can write o in the form (Vz)(Jdy P z)é(xz, y) with 
o(x, y) quantifier-free. 


The next problem are the universal quantifiers. Assume that 


a = (Vx)(Vy)(4z)¢(z, y, z). 


We have seen that we can strengthen the existential quantifier to a restricted one, 
keeping the matrix $ constant. Now we consider ^o. This sentence is preserved 
under generated subframes as well. It is of the form (3x)(Jy)v(z,y), where 
w(x, y) is restricted. We can perform a similar argument as above, using v(z, y) 
instead of ¢(x, y, z). Notice, namely, that in the proof we have needed only that 
it is reflected under generated submodels. Thus we can effectively strengthen our 
formula to (Vz)(Vy >, z)(dz be x)$(z,y,z). The last task is to remove the 
negative atomic subformulae. This can be done. According to Theorem 31 of [4] 
since o is closed under p-morphic images it is equivalent to a formula (Vz) (Sy >, 
z)(Vz b, x)v(r,y,z) where v(z, y, z) results from ¢(z, y, z) by replacing all 
negative subformulae by either false or true. 


THEOREM 20 (Proper Display IT) An axiomatic extension of Hilbert-style modal 
logic can be properly displayed (by structural rules over DLM) iff it is axiomati- 
zable by a set of primitive modal axioms. 


8 POLYMODAL LOGICS AND THE DECIDABILITY QUESTION 


The most remarkable aspect of the display calculus is that it generalizes easily to 
logics with several modal operators. Just imagine we have instead of one modal op- 
erator (and its tense dual) a finite list G; of modal operators (possibly together with 
their tense duals ©;). Then we proceed by redefining Gtruc, positing a Gentzen 
toggle e; for each pair G;, ©; and writing down the rules for introducing the oper- 
ators for each operator independently. The conditions (C1) — (C8) are immediately 
satisfied. Notice that the condition (C8) is modular in the sense that only the calculi 
restricted to the individual operators have to be checked for (C'8). If they satisfy it, 
the overall calculus does so, too. We then have the subformula property and cut- 
elimination. Define primitive of an elementary condition as follows. A first-order 
n-modal sentence is primitive if it is if the form (Vx)x where x is produced from 
atomic formulae z = y with the help of ^, V and two way restricted quantifiers 
(Vx Db; y) (Vx « y), (Ax bj y) and (3x <; y) with the extra conditions that no 
universal quantifier is in the scope of an existential quantifier and in each atomic 
subformula x = y at least one of x and y is inherently universal. 
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THEOREM 21 Let $ bea class of Kripke polyframes defined by a finite set of prim- 
itive sentences. Then the logic of $ can be properly displayed. 


If such a logic can be properly displayed we can use axioms of the form A — B 
where both A and B are made from variables with the help of T, A, V, O;, (O;) such 
that A contains a variable only once. Again, we call these axioms primitive. All 
theorems of the preceding section hold in their canonical extension to polymodal 
logics. 


We will use this fact to derive a rather negative result concerning the decidability 
of display logics. One might think that cut-elimination and the subformula prop- 
erty are enough to have decidability — but the subformula property is too weak to 
guarantee that we can give a priori bounds on the lengths of sequents occurring in 
a minimal proof of a given sequent. One might think, then, that it is just a question 
of being clever enough. However, the opposite is true. We will simply prove here 
the following theorem. 


THEOREM 22 It is undecidable whether or not a display calculus is decidable. 


The proof involves the simulation of a Thue-process. We produce here a counterex- 
ample based on a result found together with C. Grefe. Proofs and exact details can 
be found in [5].? Consider a bimodal logic with operators FH and X. We assume that 
both operators satisfy the axiom Altı, so that the relation on the Kripke frames 
satisfies the condition that each point has at most one successor in each relation. 
Suppose further that both satisfy the axiom D, so that in fact each point has ex- 
actly one successor. We will show how to code Thue-processes into such frames 
as extensions of the logic Alt;.D & Alt,.D. It will turn out that the logics are 
undecidable if the corresponding Thue-process is and so we have plenty of unde- 
cidable finitely axiomatizable logics. So, let T = (u; ~ v; | i € n} be a set of 
equations over strings in the alphabet (a, b). Recall that X specifies a relation ~ 
between words as follows. w ~o 2 if w = z; w ~n+1 z iff there is a y such that 
w ~n y andy = Jui, z = yu;y for some i < n or y = yvy, z = Yuiy. Then 
~ = U, ~p. Such equations can be mimicked by modal axioms. Define first a 
translation of strings into formulae. 


a’ = B, b = M, (vw) = vut 
(ui ~ vi) = up + vip 


For example, aab ~ ba gets translated into BH BH Kp + & FB p. Notice that the ax- 
ioms are all equivalent to primitive formulae. It can now be shown that w ~ z iff 


2 The relevant results have actually been known to Alexander Muchnik in 1974, as I have been told 
by Valentin Shehtman. He has not published his findings, though. 
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wtp € z*p is derivable in the logic K. Alt; .D @ K.Alt;.D(&). If T is undecid- 
able the corresponding logic is undecidable as well. Moreover, if T is decidable, 
so is the corresponding logic. Hence, for the logics simulating Thue-processes the 
question of decidability is undecidable. Since they are all properly displayable log- 
ics, Theorem 22 is proved. 


REMARK 23 The proof of the theorem requires at least two modal operators. It 
is not known so far whether for monomodal logics the theorem holds as well. 


9 SPEEDING UP PROOFS 


Despite its theoretic advantages, display logic 1s a rather clumsy tool in actual com- 
putation. Anyone having done or tried a few proofs in display logic will see this. 
We will therefore develop a calculus of compressed proofs that allows to speed up 
a display proof considerably. The way this problem is attacked is by noting that the 
rules of the calculus allow to compress the structures that appear during a proof into 
some smaller structures whose size is bounded a priori from the size of X and Y. 
Recall therefore the definition of z. The following holds in the classical calculus. 


X oI ~~ X 

*(X oY) =~ x*XoxY 
** X x X 

XoX cu X 

XoY ~ Yox 
Xo(YoZ) % (XoY)oZ 


These equivalences are not difficult to show. There might be more valid equations; 
however, it is enough if the ones given hold and so we fix z to be the congruence 
defined by these particular equations. Especially useful is the normal form theorem. 


DEFINITION 24 A structure term is in normal form of rank 0 if it is of the form 
Pio Pp 0...0 Pho *Q 0*Q20...0%*Q, 


where the P;, Q; are formulae; it is called reduced if all P; are different and all Q; 
are different. A structure term is called in normal form of rank n+1 if it is of the 
form 

VoeX;,oeX»2o...eoX,,0oxeY; oxeY20...0*eY, 


where all V, Xi, Y; are in normal form of rank < n. It is called reduced if all X; 
are different and all Y; are different as well, and if V is reduced. 
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THEOREM 25 (Normal Forms) Every term X € Gtruc(L) can be brought effec- 
tively into normal form Y € Gtruc(L). Moreover, it can be brought effectively into 
reduced normal form. 


Let us denote by Comp(L) the algebra Gtruc(L) /. We will call it the algebra of 
compressed C-structures. However, instead of working with compressed struc- 
tures one can also work with reduced normal forms also called reduced structures. 
Notice that normal forms are not unique, but if X, Y are equivalent normal forms 
then they contain the same number of symbols, that is, they are of equal length. A 
reduced structure contains no stacked * unless separated by a e, and * is obligato- 
rily distributed over o. No double occurrences of a structure X are within the same 
nesting of e. 


Define now the layer of a structure as follows. 


A(P) = 0 

X(I) = 0 

A(*X) = A(X) 

A(XoY) = max(A(X),A(Y)) 
A(e X) = 1+X(X) 


We can effectively count the number of reduced structures up to a given layer. The 
bounding number is computed recursively as follows. 


n(0, 7) = 2" 
n(A + 1,7) n(0, y) - 2277) 


PROPOSITION 26 C€omp(6) contains exactly n(A, {G) elements of layer € A. 


Proof. It is enough to count the number of nonequivalent reduced normal forms. 
At layer 0, we have the form 


Pı oP50o...oP,os*sQiosQso...0o*Qi 


Two such forms are equivalent iff the sets of unstarred formulae coincide and the 
sets of starred formulae coincide. There are 27 sets of formulae, where y = #G 
since the generators are assumed different. Thus at layer 0 we have exactly 27 x 
27 = 2?!G elements. Suppose then that the elements of layer À are counted by 
n(A, iG). At layer À + 1, every element is of the form 


X oeYj oeYn...00Y, okxeZi0ox*eZ20...0*9Z, 
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where X, the Y; and the Z; are of layer A. The Y; are all distinct as well as the Z;, 
though the X need not be distinct from the Y; and the Y; need not be distinct from 
the Z;. Counting the number of such elements gives n(0, 4G) - 227#@), a 


Let us define the compressed calculus to be the display-calculus read as a cal- 
culus over Comp rather than truc. A proof II in the display calculus is translated 
into a compressed proof II^ line-by-line, by translating X - Y to X/x H Y/e. 
A compressed proof can be uncompressed by translating a sequent X / zz + Y / z 
into a sequent X’ + Y' where X' = X and Y’ z Y and then adding some more 
proof steps. Although the representative can be chosen at random, there is always a 
way to complete this proof — by definition of zz. We can, however, discern among 
these choices of uncompressions some good ones that make use only of reduced 
formulae. So letu : Comp(L) — Gtruc(L) a map with u(X/ z) z X, sou 
is picking representatives from each class. u is called special if u( X) is always 
reduced. From now on u is assumed to be always special. Let (II^)" be the line- 
by-line translation of II* by u. As observed several times, (II^)" is not necessarily 
a proof in the strict sense, but there is a way to complete the uncompressed version. 
Namely if 

Xi FY ... Xn F Y, 
V-W 


is a line in the uncompressed proof, then it is of the form p = o*" where ø is an 
instance of a display-rule. So ø is of the form 


XHY! ... XHY! 
V'F W' 


O 


where X; ~ X;, Y; ~ Y;, V = V' and W = W'. We can then first move from 
X; F Y; to X; + Y; then conclude V’ + W' and the go back to V - W. 


Xi-Y] X FY, 
X FY, ... Xn Y, 
V-W 
V'EW' 
But this proof is at least as complicated as II itself, so this does not amount to a re- 


duction in any sense. The most obvious waste that is produced this way is by com- 
pleting steps that have been trivialized by this forth-and-back translation. Namely, 
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the rules (A), (P), (C) are now completely empty. For example, (X o X)" = X*", 
so if p is an instance of (C) then p°“ = id, the identity transition. Evidently, we 
can get away by just forgetting about this rule altogether. 


But there is a more economical way to complete the proof which goes as follows. 
Take, for example, the rule (+ ^). 


XHP XFQ 
XoX- PAQ 


Let there be an instance of this rule and let us compress and uncompress this rule. 
Then we get 


X"FP X"FQ 
X F PAQ 


There is no need to return to X | P from X*" + P orto X + Q from X** F Q. 
The rule can be applied directly to the new premisses but it yields the conclusion 
X**oX*"-PAQrather than X^" | P A Q. However, a single application of 
(Cl) brings the sequent into reduced form. In general, the antecedents are of the 
form 


f(Xi,..., Xx) H g(X,..., Xk) 


where the X; need not be different, but occur at most once in f and g. Let us write 
f(X) instead of f(X1,..., Xx), and f(X*") for f(Xf",..., Xg“). We know that 


f(X)™ « f(X™) 


as well as 
g(X)™ ~ g(X*") 
by the fact that zz is a congruence. Moreover, f (X) is structurally similar to f (X ^") 


and g( X) is similar to g(X cu), Thus the rule in question can be applied to the pre- 
misses 


f(X^) (X) 


Thus if the following line occurred in IT 


AQG)raQ ... fOD Fg (D) 
r(X) - s(X) 
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then we will replace it by the following proof: 


AO gi (X) f(X)** E gn(X)™ 


h)Fg(X) ... fa (X9) F gal Xe“) 
r(X*") - s(Xev) 


r(X) -'s(X) 

The starting sequents contain reduced structures and immediately before applying 
the rule we have structures built from at most k occurrences of reduced structures. 
So it has no more symbols than a term of k occurrences of reduced structures. This 
k varies from rule to rule. But we can fix € as the maximum of all k in our cal- 
culus. Moreover, we can let R be a bit smaller than that, namely the maximum of 
occurences of variables in an antecedent or succedent of a rule. Furthermore, rules 
that are emptied by compression can be left out of consideration. At present, K = 2. 


(For (A), where K = 3, the forth-and-back translation makes the rule void, so we 
do not have to care about it.) 


It should be clear that the number of steps filling the dots in the proof is bounded 
a priori; one can namely give exact bounds as to how many steps are needed to 
reduce a structure, and similarly how many are needed to mediate between the se- 
quent X H- Z andY F- Z when X z Y. In order to supply a rigorous argument here 
we give bounds on the size of intermediate sequents that need occur when passing 
from X F Z to Y F Z. First, let W be reduced and W = X. Then it is enough to 
bound the size of sequents for a transformation X H Z ~ W F Z. The crux is that 
one cannot simply count the symbols in X F- Z and hope that one never needs to use 
sequents that are longer. Namely, intermediate calculations may involve display- 
ing — and displaying usually means an increase in the length of the structure. Define 
therefore first the symbol count sc(X ) (sc(.X l- Y )) to be the function counting ev- 
ery occurring symbol except brackets (and F). Then let jj be an n-ary function and 
b; it's left or right dual for the j*^ position. Define the display factor V as follows. 


sc(b;(X1,..., Xq)) 
sc(t(X1,...,Xn)) 


By induction it is proved that 


V := maxi | # an n-ary function symbol and j € n] 


se(X + f?(W)) 


MS NPCS FRYE WY 


| f aunary structure polynomial} 


Notice that * is included in the definition of V if * counts as it’s own dual. The 
factor V then gives an upper bound for the cost of displaying material at the same 
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side. It is, however, also a bound for displaying at the other side. Namely, 


sc(«X FY) _ 
SUPA GY EX) | X,Y € Gtruc} —1 
as well as (X FY) 
SC 
—— | X,Y =1 
sup sc(x * X FY) |X, Y € Štruc} 


Notice that we do not have to consider the quotient sc(+ « X + Y)/sc(X F Y) 
because we are interested in removing * from a symbol not adding it. This allows 
to have V — 3/2 for the current calculus rather than V — 2. 


The algorithm for reducing a sequent consists in displaying structure parts for 


which a reduction can be applied, applying reduction rules and then undoing the 
display. Thus, while the reduction parts will actually not increase the length of the 
sequents it is the display strategy that can increase the length of the structure by a 
factor V. But when we undisplay, the resulting sequent will not be longer than the 
one we started off with. Thus, the only price to be payed is an increase in the length 
of occurring structures by the factor V. 
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HEINRICH WANSING 


A PROOF-THEORETIC PROOF OF FUNCTIONAL 
COMPLETENESS FOR MANY MODAL AND 
TENSE LOGICS 


] INTRODUCTION 


Suppose £ is a propositional logic with a finite set & of primitive finitary opera- 
tions. Moreover, assume that S is a (strongly) sound and complete semantics for £, 
and Qis the class of all finitary operations explicitly definable in S. It may be that 
Q is defined by imposing some natural constraints on semantical models, if 5 is a 
model-theoretic semantics. Of course, these constraints must leave the connectives 
in X explicitly definable from finite combinations of connectives in Q. The problem 
of functional completeness for £ consists in finding a proper subclass I‘ of 2 such 
that every connective in (? is explicitly definable by a finite number of compositions 
from the elements of I’. In a proof-theoretic semantics for L, the class 2. of permis- 
sible connectives is given by general schemata for introducing finitary operations 
into premises and conclusions; see, for instance, [6], [7], [9], [10], [11] for proof- 
theoretic proofs of functional completeness for minimal propositional logic, intu- 
itionistic propositional logic, Nelson's constructive propositional logics and vari- 
ous substructural subsystems of these logics. 


It seems, however, that this approach cannot directly be applied to the standard 
Gentzen-style proof systems for tense or modal logic. The reason is that the op- 
erational rules of these proof systems fail to be explicit or even weakly explicit. 
Ohnishi’s and Matsumoto's [8] right rule for O in their sequent system for S5, for 
instance, not only exhibits O on both sides of the sequent arrow in the conclusion 
sequent, but even exhibits O in the premise sequent: 


OA > OPr,ArFoOA—-dr,oA 


(here A, T' range over finite sets of formulas and OA = {OA | A € AJ). It is not 
clear how such rules could be captured by a proof-theoretic semantics in terms of 
introduction schemata. 
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In the present paper, it is shown that the proof-theoretic approach towards func- 
tional completeness can be applied to normal propositional modal and tense logics, 
if these systems are presentable as systems of (a suitable version of) Belnap’s dis- 
play logic (DL), see [1], [2], [12], [13]. Due to an enriched structural language in 
display logic, G (“always in the future", alias O) and H (“always in the past") can be 
given explicit introduction rules in a display calculus L.A, for the minimal normal 
tense logic K+. Moreover, many important axiom schemata like D, T, 4, 5, and 
B can be expressed as purely structural rules, which results in a modular Gentzen- 
style proof theory for the most important systems of modal and basic tense logic. 
We shall use DL to define a natural proof-theoretic semantics in terms of general 
introduction schemata. The set of connectives 6 = {G, H, 5, A} will turn out to 
be functionally complete for K+, and V = {O, 2, A} will be shown to be function- 
ally complete for K, relative to straightforwardly restricted introduction schemata. 
The demonstration of functional completeness is such that it immediately reveals 
$ and W to be functionally complete for any displayable extension of K; and K, 
respectively.! Since the systems to be dealt with are based on classical propo- 
sitional logic, there is no need to consider higher-level sequents as, for example, 
in the case of intuitionistic logic, cf. [6], [9]. Moreover, in contrast to the proof- 
theoretic semantics for substructural logics in [10], [11] and [14], we need only 
one right and one left introduction schema. 


2 DISPLAY LOGIC 


Display logic is a Gentzen-style proof-theoretic framework using a generalized 
structural language. This language comprises four structural connectives, the null- 
ary I, the unary operations * and e, and the binary o. Every (tense logical or modal) 
formula is considered to be a structure, and the structural connectives are used to 
build up more complex structures. We shall use X, Y, Z, X, X» etc. to denote 
arbitrary structures and A, B, C etc. to denote arbitrary formulas. The structural 
language can be motivated by general considerations on what would be desirable 
and natural, if one is dealing with a deducibility relation — between possibly com- 
plex data and goals (cf. [4]). In this context, the structural connectives can be un- 
derstood as follows: 


— Iis the empty structure. 


1 This notion of functional completeness must, of course, not be confused with the notion of temporal 
completeness over a class of Kripke frames, cf. [3, p. 116 f.]. However, there is a connection. Whereas 
certain natural temporal operations like “until”, which are first-order definable on Kripke frames, cannot 
explicitly be defined from G, H, and the Boolean connectives, one may ask for a semantics with respect 
to which G, H, and the Boolean connectives turn out to be functionally complete. Such a semantics is 
presented in the present paper. 
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— o is structure addition. . 
— * shifts structures from one side of — to the other. 


— e is an intensionality mark switching from one side of — to the other. 


The structural meaning of the structural connectives becomes clear from the basic 
structural rules of DL. | 


Basic structural rules: 


10) XoY 2OZd- X 2 ZoY*4+Y 2 X*oZ 
(2 XoOYoZJ-XoZ*2Yd-Y'oX AZ 
(3) X2YdFY'2X*J-X2Y* 

(4) X Y d- eX >Y, 


where X, — Y; JF- Xə > Y3abbreviates X, > Yı F Xə — Yo and X — 
Y> F X1 — Yi. If two sequents are interderivable by means of (1) - (4), then these 
sequents are said to be structurally or display equivalent. 'The following pairs of 
sequents are structurally equivalent on the strength of (1) - (3): 


XoY => Z Z*—Y*'oX* 
XoYoZ Z*oY* + X* 
XY X* >Y 
X* >Y Y* +> X 
X — Y* Y > X*. 


This ‘geometry of structures’ is suitable for formulating introduction rules for quite 
a few logical operations known from the field of substructural logic. Since in this 
paper we are interested in normal tense and modal logics extending classical propo- 
sitional logic, we shall, however, assume further structural inference rules which 
make void distinctions between logical operations that otherwise would be at our 
disposal. 
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Additional structural rules: 


I+) XoZFtloX oZ 
XZ-r XoIoZ 
(I-) IoXoZ-XoZ 
Xol—oZk-HXoZz 
(ler) I> XĀF-Z>X 
(ec0) X>IFX>Z 
(A) Xı o (X20 X3) > Z Ak (X10 Xo) 0X3 > Z 
(P) XioX:> ZF- X2oX1 Z 
(C) XoX 9ZEFX AZ 
(M) X> ZEX10X%, 9 Z 
X1 Z- X2o0X12Z 
(MN) I—-bXFreI—X 
I—^XFre(X*)2T. 
This menu of structural rules allows one to dispense with rules like permutation 
on the right hand side of —, which would make the calculus more symmetric. The 
following context-sensitive reading of the structural connectives gives rise to the 


introduction rules for the usual logical operations of propositional modal and sim- 
ple tense logic: 


structural connective 
Lo 10 | 0 | 
P “sometimes in the past” | G (0) 


(290 XoIFX0 
(0>) F0I 
(21) FI—1 
(12) I—^XFr1oxX 


(23923) X> A'-X2-^A 
(a+) ATAXFAASX 
(> G) eX => A-X 9 GA 
(Gs) A—^XFr- GA eX 
(>F) X-—AF(e(X*)' FA 
(F >) (e(A*)'!—YFFAoY 
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(2H) X > (e(A*))*+ X HA 
(H >) A—^XFHA- (0(X*))* 
(> P) Xo AFeX— PA 
(P>) A—eXrFPAX 


(9A) XA YOBEX0Y SAAB 
(A>) AoBAXFANBOX 


(>v) Xo AoBFX-—AVB 
(V>) A—^X BoOYFAVBOXoY 


(9D) XoA—BFX-—ADB 
) XA B—-5YFAD2B29 X*oY. 


Finally, there are two 


Logical rules:” 
(Il FA-4A and (Cu) X24 AS2Y-HXOY. 


Let us refer to the above sequent system as Lz, and let us refer to the result of 
dropping the rules for H and P from LK; as LK ? 


Display logic derives its name from the fact that any substructure of a given se- 
quent s may be displayed as the entire antecedent or succedent, respectively, of a 
structurally equivalent sequent s'. In order to state this fact precisely, we need a few 
definitions. An occurrence of a substructure in a given structure is said to be posi- 
tive (negative), if it is in the scope of an even (uneven) number of *'s. The structure 
X (Y) is called the antecedent (succedent) of X — Y. An antecedent (succedent) 
part of a sequent X — Y is a positive occurrence of a substructure of X or a neg- 
ative occurrence of a substructure of Y (a positive occurrence of a substructure of 
Y ora negative occurrence of a substructure of X). One can then prove (see [1, p. 
381]: 


THEOREM 1 (Display Theorem) For every sequent s and every antecedent (suc- 
cedent) part X of s there exists a sequent s' structurally equivalent with s, such 
that X is the antecedent (succedent) of s'. 


Next, consider the following translation T of sequents into (tense logical) formulas: 


T(X —Y)-2 n(X) DRY), 


?It would be enough to require + p — p, for propositional variables p. (Id) can then be proved by 
induction on A. 
3Thus, LK here does not denote the standard sequent system for classical predicate logic. 
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where 7; (i = 1, 2) is defined as follows: 


7;(A) — A 

Ti (I) = 1 

T2(T) = 0 

™1(X*) = on(X) 
Tr(X* = n(X) 
m(XoY) = n(X)^m(Y) 
T((XoY) = 2(X)VR(Y) 
T1 (eX) = Pr (X) 

T2 (eX) = Gr(X). 


THEOREM 2 (i) If - A in Ki, then- Y > Ain LK.. (ii) If X — Y in LK, 
then- T(X — Y) in Ki. 


Proof. See [12], [13]. E 


COROLLARY 3 (i) Ain K, iff- I— Ain LK.. (i)- Ain K iff} I — Ain 
LK. 


Proof. (i) By the previous theorem. (ii) By the obvious fact that each normal propo- 
sitional tense logic which is complete with respect to a class of Kripke frames is a 
conservative extension of its fragment in (0, 1, G, F, 5, A, V, >}.4 E 


A normal propositional tense (modal) logic S is said to be properly displayable 
if (i) S can be presented as a calculus of sequents which satisfies certain conditions 
ensuring the subformula-property and the eliminability of Cut (see [1], [15]), and 
(ii) S is the result of extending L.A, (LK) by purely structural inference rules. The 
class of all properly displayable normal propositional tense logics has recently been 
characterized by Marcus Kracht [5]. In fact, many important axiom schemata can 
be captured by purely structural rules of inference, thereby giving us a modular 
sequent-style proof theory for extensions of K; and K. If R is any of these axiom 
schemata, we shall associate with R a structural inference rule R’, for instance: 
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eex — Yrex —Y 


(e( X” =Y XY 
H e((s(X*))*) ^ Y 
((X*)'—5Y XY 
Ft (e((eX)*)* + Y 


Let A be the set of all of the above axiom schemata R and T C A. Then I" = 
{R| RET}. 


THEOREM 4 (i) In LK,UI', - I Aiff- Ain K; UT. (ii) Cut is admissible 
in LK; UI". 


Proof. (1): see [12], [13]; (ii): see [1]. a 


3 PROOF-THEORETIC SEMANTICS 


We shall assume the structural language, the logical rules, and the (basic and addi- 
tional) structural rules of the previous section. This is our ‘structural framework’; 
and we will refer to it as B. The idea of the proof-theoretic semantics is to delimit 
the class of admissible connectives by specifying general schemata for introducing 
(finitary) connectives of a propositional language L into premises and conclusions, 
that is, on both sides of the sequent arrow. Of course, these schemata must not be 
arbitrary, cf. [6]. A rule schema for an n-ary connective F should exhibit no other 
connective than F’, and the deductive role of formulas F(A,,..., An) should de- 
pend only on the deductive relationships among the parameters A;,..., An. More- 
over, the schemata should be conservative (or non-creative), that is, each proof of 
an F-free formula A should be convertible into a proof of A with no applications 
of rules characterizing F. This condition boils down to the requirement that ap- 
plications of cut, in which both premise sequents are derived by introducing the 
cut-formula F(A1,..., An), can be eliminated. 


DEFINITION 5 The general right introduction schema is this: 


(2 F) X1 Yi... XX —9 Y, F X10o...0 X, > F(Ai,..., An), 


4F is the operation “sometimes in the future". 
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where every X; (i = 1,...,k) is an unspecified structure, every Y; contains only 
formulas from A, ..., An, and every Aj (j = 0,...,n) occurs in some Y;. 


We have to state a left introduction schema such that the eliminability constraint 
is satisfied. In order to be able to succinctly formulate such a schema, we define 
certain operations on sequents. 


DEFINITION 6 The operations (-)', (-)" on sequents are simultaneously defined 
as follows: 


IZ ifY =I 

AZ ifY —A 
(YoZ)'=<{ (WZ)! ifY = eW 

(Z> WY if Y =W* 


(Wi > Zi)! (W2 > Za) ifY = Wio W2 


Z>I ifY =I 
ZA ifY = A 
(Z4Y)"=< (Z> WY if Y = eW 
(W > Z) if Y = W* 


(21 — Wi)" (Z2 — Wa) if Y —2Wio Wa, 
where Z, Z1, and Zz are unspecified structures. 


Clearly (Y — Z) is a sequence of sequents each of which has either the shape 
A > X,X > A,I  X,or X — I. Moreover, for every occurrence of a 
formula A in Y, there is exactly one sequent A — X or X > Ain (Y — Z)'. We 
shall call X the structure corresponding to this occurrence of A in (Y — Z)’. 


DEFINITION 7 The general left introduction schema is: 


(F -) (Y — Wwy H F(Ai,..., An) — (Yi)! 


(Y 4 W) H F(Aj,... , An) — (Yp), 


where W is an unspecified structure and (Y;)* is the result of replacing every occur- 
rence of a formula in Y; by its corresponding structure in (Y; — W y. 


EXAMPLE 8 Consider the following instantiation of (> F): 


X = (A oeB)oe(I*)t X > F(A, B). 
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We have 


((A* o eB) o e(I*) 2 W)! 

(A* o eB 2 Wi)! (e(1I*) 2 W2)' 

(A* — W, )' (eB — Wi, yl (I* — W3)! 
(Wi, > Ay (B > Wy)! (W2 > I)" 
W, > AB>W,W>I 


and obtain the following instantiation of (F +): 
W,,7~ ABO Wi W IF F(A,B) > (Wy, o «W,) o e(I*). 
We must verify that the eliminability constraint is indeed satisfied. 
OBSERVATION 9 For every proof 


X12 Y eX.) > Yk (Y; 2 W)! 
(1) X,0...0X, > F(Ai,..., An) (2) F(Ai..., An) > (Yi)! 
(3) X10...0 X, > (Y;) 


in which the cut-formula F(A;,..., An) is introduced in the inferences ending in 
(1) and (2) and in which (3) is not identical to one of (1) or (2), there is a proof 
of (3) in B + (— F) + (F —) from the premises of (1) and (2) in which every 
cut-fromula of any application of cut is a proper subformula of F(A1,..., An). 


Proof. Suppose (Y; > W)! = Si, ...8:,,. Consider the sequent s;,. If s;, = 
A — V or V — A, then it is clear from the definition of (-)' and (-)” that a certain 
occurrence of A in Y; is a succedent part of X; — Y; or an antecedent part of X; — 
Yi, respectively. We transform X; — Y; into a structurally equivalent sequent U — 
Aor A — U, respectively, and then apply cut to obtain U —> V or V — U, 
respectively. Then we iterate this process for s;,,..., Si„, (if possible) and finally 
obtain X; > (Y;)'. Applying (M) gives X4 o...o X, > (Y;)*. a 


EXAMPLE 4 (continued) We show in some detail how the procedure is applied. 
Our starting point is the following application of cut: 


X > (A* oeB)oe(I*) Wi, 7A BOW, W.-!I 
X^ FB) F(A,B) > (Wy o W) oe) 
X — (Wi, o eW1,) o e(I*) 
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We first display A in X — (A* o eB) o e(I*) and use it as cut-formula: 


X — (A* o eB) o e(I*) 

X o (e(I*))* 4 A* oeB 

(X o (e(1*))*) o (eB)* ^ A" 

Wi 3A  A-*((X o(e(I*))*) o (eB)*)* 
Wi, 2 ((X o («(D))*) o (SB) 


Then we display B in W1, — ((X o (e(1*))*) o (eB)*)* and use it as cut-formula: 


Wi, > ((X o («(I'))*) o (eB)*)* 

(X o (e(I*))*) o (eB)* 2 Wy, 

X o (e(I*))* 2 Wr, ° eB 

Wi, o (X o(e(I*))*) > «B 

e(Wi, o(Xo(«I')))B BW, 
e(Wi, o (X o (e(1'))*)) > Wi, 


The conclusion sequent we have reached is structurally equivalent with the sequent 
we wanted to derive, viz. X — (Wy, o eW1,) o e(I*). 


One might be inclined to object that the schema (— F) is not general enough, 
since it permits only connectives with exactly one right introduction rule. This is, 
however, no real restriction, since multiple right introduction rules can be inter- 
preted disjunctively. We shall illustrate this by means of a simple example.? 


EXAMPLE 10 Consider the four-place connective «1 with the following right in- 
troduction rules: 


(><) X1, >A Xi, > BEX, 0M, 24 (A,B,C,D) 
X24 >C X25 + Dr X2 o X», >< (A, B,C, D). 


(— «1) is interreplaceable with the single rule 


(2ay X>AoC X2 AoD X2 BoC X+BoDt 
- X =< (A,B,C, D). 


51f there is more than one right introduction rule, it seems to be natural to require that every A; 
(J = 0,..., 7) occurs in a premise sequent of some right rule. 
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(=<) > (— «)': Consider the following derivation: 


X> AoC X-BoC 
XoC*> A XoC*>B 

(X 0 C*) 0 (X 0 C*) 2 F(A,B,C,D) 
X oC* > F(A, B,C, D) 

X > F(A, B,C,D)oC 

F(A, B,C,D)* 0X 9C 


Similarly we derive F'(A, B, C, D)* o X — D and continue as follows: 


F(A,B,C,D)*oX >C F(A,B,C,D)* oX >D 

(F(A, B,C, D)* o X)o (F(A, B,C, D)* o X) > F(A, B,C, D) 
F(A, B,C, D)* o X 29 F(A, B,C, D) 

F(A, B,C, D)* => F(A, B,C, D) o X* 

F(A, B,C, D)* o F(A, B,C, D)* + X* 

F(A, B,C, D)* + X* 

X 2 F(A, B,C,D). 


(2.4) > (=<): Consider the following derivations: 


X1, À X1, B 

X1, o X1, >A Xi, o Xi, +B 

(Xi, o X1,)o C* — A (X1, o X1,)oC* +B 
Xi, 0 X1, 9 AoC X1, 0 X1, + BoC. 


Similarly we derive X1, o X4, > Ao D and Xj, o Xj, — B o D. Applying 
(+<1)' gives Xi, o X1, > F(A, B,C,D). 


Note that (— «1) is also interreplaceable with 


(9a)! X1— AoC X, AoD X33 BoC X4, BoDF- 
FE Xı 0 X20 X30 X4 4 (A, B, C,D). 


For the proof of functional completeness, we shall need a replacement theorem. 
Let CA denote an L-formula, which contains a certain occurrence of A as a subfor- 
mula, and let Cg denote the result of replacing this occurrence of A in C by B. We 
use [B/ A| X to denote the result of replacing every occurrence of A in the structure 
X by B. The degree of A (d(A)) is defined as the number of occurrences of propo- 
sitional connectives in A. Lett X € Y abbreviate- X Y and Y > X. 
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THEOREM 11 ff- Aw BinB+ (> F)+(F -), than- C4 o Cp inB + 
(2 F)+(F >). 


Proof. If d(C4) — d(A) = 0, there is nothing to prove. Suppose therefore that 
d(CA) — d(A) > 0. Then C4 has the form F(A,,...,An)a. For every i = 
1,...,k, we consider a certain instantiation of (Y; — WY)! = sj, ...si,.. If $i, 
has the shape A + V or V > A (q = 1,...,m), we instantiate V by B. If si, 
has the shape I — V or V — I, we instantiate V by po p", for some propositional 
variable p. If s;, has the shape D + V or V — D and A # D, we instantiate V by 
D. Obviously, every instantiation of any s;, is provable. Therefore by (F —) we 
obtain F F(A1,..., An)a > [B/A]Y;, and by (> F) together with (C) it follows 
that - C4 — Cg. Analogously, we can show - Cg > Ca. a 


4 FUNCTIONAL COMPLETENESS 


We shall first show that 6’ = {0, 1, ~, G, P, A, V) is functionally complete for K; 
and then prove functional completeness of Y’ = {0,1,7,G, A, V) for K. Func- 
tional completeness of ® for K+ and Y for K follows from standard definitions. By 
induction on the complexity of X , we can prove 


LEMMA 12 In LK;: (i) - X > n(X). (ii) + r(X) > X. (iii)- X > Y 
implies - (X) — Y. (iv) Y — X implies- Y > r2(X). 


THEOREM 13 4' is functionally complete for K+. 


Proof. Suppose that the rules for F` are instantiations of (—> F) and (F —). Then 


F r4(Yi) > Yi by Lemma 12 (ii) 
— FT(Yi) o...oT2(Y.) > F(A,..., An) by (> F) 
=> FHFnlrnlYi)o...oT2(Yk)) 9 F(A1,..., An) by Lemma 12 (iii) 

Moreover, 

H F(41,..., An) > Yi by (Id) and (F —) 
=> HkF(A,...,A4) 2 72(Yi) by Lemma 12 (iv) 
=> | F(A,,...,An) > (Yi) o...o T2(Yk) by (M) and display 

equivalence 


=> + F(A,,...,An) > 1(72(¥1)0...°72(¥,)) by Lemma 12 (i) 
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By the previous theorem, it follows that F(A,,..., An) and rj (r2(Yi)o. . .oT2(Y;)) 
are provably interreplaceable. 


It still has to be shown that for every primitive operation ł of K+, either the rules 
for f are (interreplaceable with) instantiations of (+ F) and (F —) or fj is explic- 
itly definable by a finite composition of operations from ©’. This can be verified 
by eye and the following straightforward observations:(i) (0. —) is interreplace- 
able with (0 — I X + 0 > I (ii) (7 —) is interreplaceable with 
(o) X +> AF AA — X*. (iii) The ‘multiplicative’ rules for ^ are in- 
terreplaceable with their ‘additive’ formulations: 


(>^) XA X>ABEX- (ANB) 
(A+) A—XF(AAB)—^X 
B XF(AA^AB)— X. 


(iv) The rules for D are interreplaceable with: 


(~D) Xo A*oBt X 5 (A B) 
(252) A*—X BOYFADBO XY. 


(v) 1 is definable as (p V —^p) for some propositional variable p. (vi) FA is definable 
as ~G-—A; HA is definable as -P—4A. In the logical language of K+, we thus have 
FX- Ain K, iff- X 2 AinB+ (> F) + (F 9). E 


EXAMPLE 4 (continued) We get (~A V GB) V G-1 as definiens for F(A, B). 


THEOREM 14 V' is functionally complete for K. 


Proof. The proof is completely analogous to the proof for K; except that now the 
schemata (— F) and (F —) have to be suitably restricted: in every Y; the struc- 
tural connective e may occur only within the scope of an even number of *'s. Then 
the definiens 71 (2(Y1) o... o 72(Y;)) remains in the language of modal logic. Wi 


The cut rule remains eliminable, if the conditions on inference rules defining 
properly displayable logics are slightly generalized, so as to allow to present, for 
example, classical linear propositional logic as a cut-free display calculus, see [2], 
[15]. If S (1) satisfies these more general conditions, and (ii) S is an extension of 
LK, (LK) by purely structural rules, then 5S is said to be a displayable normal 
propositional tense (modal) logic. 
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COROLLARY 15 (i) 9 is functionally complete for every displayable normal 
propositional tense logic.(ii) Y is functionally complete for every displayable nor- 
mal propositional modal logic. 


Proof. Adding purely structural rules does not affect the above proofs of functional 
completeness. E 
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RAJEEV GORE 


ON THE COMPLETENESS OF CLASSICAL MODAL 
DISPLAY LOGIC 


1 COMPLETENESS OF CMDL 


We wish to prove that the CMDL system of Wansing [2] and Kracht [1] is sound 
and complete with respect to the tense logic Kt. Fortunately the direction corre- 
sponding to soundness of CMDL is easy, leaving us with the task of proving com- 
pleteness. 


The known proofs of completeness for CMDL proceed via the following trans- 
lation 7 of structures (denoted by X, Y and Z) into formulae (denoted by A): 


(XE Y) = (n(X) 2 n(Y)) 


71 (A) = A T2(A) = A 

Ti (1) = T T2(L) = l1 

ri (eX) = $7:(X) T2(@X ) =  Un(X) 

Ti (+X) = ^ (X) T2(*X) = -n(X) 

n(X oY) = 1(X)A(Y) T2(X oY) = T2(X) V (Y) 


With respect to this translation, the statement of completeness is: if 7(X + Y) 
is a theorem of Kt then X F Y is provable in CMDL. 


Note that completeness here is stated at the level of arbitrary structures X and 
Y, not arbitrary formulae as in the original proof of Wansing recorded below as 
Lemma 1. 


Kracht [1] has already proved completeness at the level of structures, but his 
proof is quite complicated involving not only a reference to Hilbert characterisa- 
tions of classical modal logics, but also the addition of extra rules which then have 
to be shown to be admissible. Here we give a greatly simplified and completely 
syntactic proof of the completeness of CMDL at the level of structures. 


137 


H. Wansing (ed.), Proof Theory of Modal Logic, 137-140. 
© 1996 Kluwer Academic Publishers. 


138 RAJEEV GORE 


LEMMA 1 The sequent I + A is provable in CMDL iff the formula A is a theorem 
of the tense logic Kt. See Wansing [2]. 


In particular, if we know that the formula 7(X + Y ) is a theorem of tense logic 
Kt then we know that I + 7; (X) — 72(Y) is provable in CMDL. 


LEMMA 2 If I - (X) — 72(Y) is provable in CMDL then T, (X) + n(Y) is 
also provable in CMDL. 


Proof. Below, (dp) stands for "display postulates". 


n(X)Fmn(X) (YY) (Y) ab) 

IF-n(X) nY) n(X) > (Y) F #71 (X) 0 72(Y) (cut) 
IF #7(X) 072(Y) 

Iom(X)F n(Y) 


I F 
n (X) F To(Y ) ) 


LEMMA 3 For any structure Z both Z + 1(Z) and r2(Z) + Z are provable in 
CMDL. 


Proof. By simultaneous induction on the structure of Z. The lemma holds for all 
formulae A since 7; (4) = 72(A) = A. Similarly, it holds for J since 4 (7) = T 
and r2(I) = L, and both J - T and L F I are provable in CMDL by the rules 
(F T) and (L F) respectively. 


Suppose the lemma holds for all structures of degree k and suppose Z is of de- 
gree k + 1. We consider each case in turn depending on the outermost connective 
of Z. The induction hypothesis is identified by I.H. below: 


I. H. 
I. H. I. H. r(X)- X I. H. 
Xr n(X) YE n(Y) *X F *T2( X) Xk n(X) 
XoY - n(X)An(Y) &X F aT2(X) eX + 9r (X) 


T1 T: 
X oY - n(X oY) *X + mn (*X) eX | r; (eX) 
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I. H. 

I. H. I. H. X F n(X) LH. 
T2(X)F X T2(Y) FY *n(X) b «X rTY(X)- X 
T2(X) V r2(Y) XoY An(X)F «X Or2(X) bk eX 
-----lIT-ADaADC T) | --------(m)  --------(7) 

T((XoY)E- XoY To(*X) - «X T2(eX)F eX 


THEOREM 4 If7(X l- Y)isatheoremoftense logic Kt then any sequent X LY 
with T(X + Y) = r(X F- Y) is provable in CMDL. In particular, the sequent 
X -Y is provable in CMDL. 


Proof. First note that if T(X F- Y) = T(X + Y)then n(X) = n(X) and 
TY) = Te (Y ) thus giving us one line proofs of 7; (X) H n (X) and r(Y) H 
r4(Y) in CMDL (depicted as note 1 below). Second, the fact that (X + Y) isa 
theorem of tense logic Kt, together with Lemmas 1 and 2 gives us that 74 (X) H 
T2 (Y ) is provable in CMDL (depicted as note 2 below). Now we can complete the 
proof by constructing a CMDL proof of X t- Y as follows (using the cut rule which 
we know to be eliminable): 


note 2 note 1 
n(X)F (Y) nr(Y)F 72(Y) Lemma 3 
note 1 n(X)F r(Y) r(Y)r Y 
Lemma 3 n(X)-nX) . ^ ^ WFF 
Xt n(X) n(X)rY 


X-Y 


Lemma 3 is also at the heart of Kracht’s proof for it allows us to derive his extra 
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(inverted) operational rules by using cut. For example: 


Lemma 3 E A ^ BEX (n XF DA Lemma 3 
AoBkn(AoB) n(Ao B)- X X + T;( eA) T;( eA) - eA 
AoBFX X F- «A 

derivation of Krachts's (o F) derivation of Krachts's (I- e) 
The point is that each structural connective (in the appropriate position) is an exact 
proxy for the logical connective it replaces. H 
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CLAUDIO CERRATO 


MODAL SEQUENTS 


1 INTRODUCTION 


We present some results about non-standard calculi of sequents for normal modal 
logics (NLs). While for standard calculi the attention is on possible extensions to 
more complex NLs (e.g., see Goré [9]), or on searching for a unified theory, like 
the subformula property (as explored in Takano [18]), non-standard calculi have 
been developed by changing the rules of the game, searching an unified view by 
modifying the structure of the calculus. 


A first line is outlined (for different purposes) in Došen [6], where higher lev- 
els were introduced to control metalinguistic behaviours. In Cerrato [1, 2, 4], the 
attention is focussed on developing an uniform treatment of sequent calculi for the 
main 15 NLs between K and S5, namely for K, KB, KD, KT (=T), K4, K5, KBD, 
KBT (=B), KB4, KD4, KD5, KD45, K45, KT4 (2S4), KTS (2S5). Techniques 
are different, but the trend is the same: to find a structure that work uniformly well 
for all those NLs. In sequent with modal signs (see [1]) metalinguistic signs were 
added to separate modal behaviours from propositional ones; the treatment is uni- 
form, but it does not seem to work well for cut-elimination (as pointed out by Goré 
[10]). Semantic modal sequents directly introduce Kripke accessibility relation into 
the structure of the calculus, leading to an uniform treatment that is cut-free for all 
those NLs. The semantic proof exhibited (following the line of Girard [8]) bypasses 
the problem of syntactic cut-elimination, that is afforded by modal tree-sequents, 
in [4]: those sequents are a variant of semantic modal-sequents, where the tree- 
structure remains, but the accessibility is absorbed into peculiar modal rules; syn- 
tactic cut-elimination is proved by extending the usual proof (see Takeuti [17]) to 
modal sequents. 


In this work, the language is L = (P, A, V, ^, +, 01); we define the other op- 
erators as follows: 


equivalence A + B def (A> B)A(B >A) 
possibility QA =def -0-A 

strict implication A — B =def O(A > B) 

strict equivalence A < B =def (A> B)A(B= A) 
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Furthermore, T and L denote a generic theorem and a generic contradiction, re- 
spectively. Actually, for sequents with modal signs we use possibility as primitive 
operator, since its dual role w.r.t. necessity is managed by specific rules. 


As to notation, we use A to refer to a system among K, KB, KD, KT (=T), K4, 
K5, KBD, KBT (=B), KB4, KD4, KD5, KD45, K45, KT4 (=S4), KTS (2S5) (see 
Chellas [5]). We exhibit a basic calculus for K, and specific rules corresponding to 
axioms T, 4,5, DB, D, 


T: [JA A 
4: DA-OOA 
5: OA 09A 
B: A UQ9A 
D: DA 9A 


obtaining any A-calculus by adding the rules corresponding to the specific modal 
axioms of A to the K-calculus. When proving soundness of NLs-calculi, we re- 
ally prove that for any NL A if there exists a sequent-style A-proof of a formula A 
(written, A is A-provable) then there exists an Hilbert-style A-proof of it (written, 
A is a A-theorem or -A A), so that (by soundness of usual Hilbert-style systems) 
A is true in any A-model (written, A is A-valid). As to completeness, we exhibit 
different proofs depending on the nature of sequents. 


As a standard line, first we develop our calculus for the minimal normal modal 
logic K, exhibiting the main ideas, and then we extend them to the other systems. 
In the Hilbert style (see [5]), the system K: 


1. is a modal system, i.e. a system closed under the rule of inference 


Ay Ans. An 
A 


RPL 


where A is a tautological consequence of A1, Ag,..., An 


2. is closed under the rule of necessitation 


3. contains the axiom schema 


K : O(A > B) a (DA > OB) 
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4. is closed under the rule 
Aj A...AAn >A 
“DA, A...AQA, > 0A 


5. contains the axiom schema 


Df0:0A 4 -D-A. 


Condition (5) depends on which language we adopt, while the other conditions 
are not mutually independent: so, for sequents with modal signs, we consider the 
conditions (1), (4), (5), whereas both for semantic modal sequents and for modal 
tree-sequents, we consider the conditions (1)-(3). 


2 SEQUENTS WITH MODAL SIGNS 


Sequent calculi for normal modal logics with separated modal and propositional 
behaviours are presented in [1], proving a cut-elimination theorem for the basic 
system K. 


In these sequents formulas can occur in a modal (possible or necessary) way; to 
denote this fact, we add to the usual modal language two metalinguistic symbols, 
namely ( ) and [], and we sign by such symbols formulas occurring in a modal way. 


Sequents range on sets of formulas either unsigned or signed by ( ) or [ ], and 
our calculus transforms the modal way formulas occur into modal operators. 


Signed formulas, e.g. (A), appear quite similar to the corresponding usual 
modal formulas, e.g. >A, but they are not really the same thing. In fact, (A) stresses 
the modal nature of the formula, and so modal rules may work on it, while QA 
stresses the propositional nature, and so usual propositional rules may work on it. 
So, from a semantic point of view (A) and QA ([A] and DA) are the same thing, 
while, from a procedural point of view they are different. In this sense the metalin- 
guistic signs ( ) and [ | have nothing in common with the semantic metalinguistic 
signs used in semantic tableaux (see [7]). 


Furthermore, we allow directly transforming (A) into OA, but we do not allow 
the vice versa; in such a way we introduce an ordering in applying rules (firstly the 
modal ones, then the propositional ones), without any lack in the expressive power 
(in fact, completeness also holds). 


The rules of the calculi are the usual ones, plus general rules to treat modalities, 
plus peculiar rules, each one for each modal axiom instead of each one for each 
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modal system, as usual (e.g. see [7, 13, 14]). Formally, let ( ), [] be two new met- 
alinguistic symbols: 


(A), [A] are signed formulas, where A is a formula of L; 
A, (A), [A] are expressions, where A is a formula of L; 
DI F A is a sequent where I’, A are (finite) sets of expressions. 


As to notation, we use capital latin letters for formulas of L, small greek letters 
for expressions and capital ones for sets of expressions. A signed formula is made 
of two parts: 


1. a formula of the language L, possibly with explicit modal operators; 


2. ametalinguistic sign whose role is to stress the way the formula occurs; this 
sign has no reference to explicit modal operators (if any) occurring in the 
formula. 


Metalinguistic symbols cannot be joined together by connectives to form other 
signed formulas. The expressions are formulas together with the indication of the 
way they occur: 


1. non-modal way 

2. ‘possibly’ way 

3. ‘necessarily’ way. 

In the first case expressions are really formulas of L, and we call them unsigned 
formulas, otherwise they are signed formulas. Actually, the expressions are the ob- 


jects of our calculus, so the sequents are made of expressions and rules apply to 
sequents of expressions. We divide the rules in: 


Logical rules regarding logical connectives 
Structural rules regarding sequents’ structure 

Duality rules regarding transformation of modalities 
Modal rules characteristic of each modal system 


Logical rules are the usual ones 5 F,F ~, A -,F A, V E,F V, >F, F> (see 
[16, 17]) and affect only unsigned formulas; so e.g. the rule V F (V: left in [17]) is: 


CFA DFA 
CVD,TEA 
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where C, D are formulas and I’, A are sets of expressions. The structural rules 
are: 


TFA TFA 
Thinning + ———— L Thinning ———— 
a HFA [Tt Ava 


where a is an expression and I’, A are sets of expressions, and: 


lD-A,D DFA’ 


Cut 
T, T'HA, A’ 


where D is a formula and L', I", A, A’ are sets of expressions. Thinning is like 
weakening in [17]; furthermore, we need neither contraction nor exchange rules 
because we use sets (as in [16]) instead of sequences (as in [17]). 


Duality rules convert metalinguistic signs into modal operators or transform 'pos- 
sibly' into ‘necessarily’ and vice versa. Such rules apply only to signed formulas: 


[A], FA CEA, [A] 
poo QQ 


DA, FA THA, DA 
(A), THA Lo PRAM) 
OA TEA l-A,0A 

TEA, [A] [A], CF A 

OOF ———— +oOo ———— 

(3A), PEA TEA, (3A) 
TEA, (A) (A), TRS 

oor ———— +60 -——— 
mA], TH A T F ô, [~A] 


where A is a formula and I, A are sets of expressions. 


The notions of proof and of end-sequent are defined as in a standard calculus of 
sequents [17]; an initial sequent is one of the kind A | A; a formula A is provable 
iff - A is an end-sequent. 


To develop a calculus of modal sequents for K we introduce the first modal rules, 
namely K -rules: 


ATA ,Q PRAA 
(A), [T] F (A) [T] - (A), [4] 
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where A is a formula, [ and A are sets of formulas, and, given a set of formulas 
$ = {Fo, ..., Fn}, [9] = [Fo, ..., F4] = [Fo], --- 5 [Fn], (9) = (Fo,...,Fn) ~ 
(F9),..., (Fn), [Ø] = (9) = Ø. Note that in K-rules the premises contain only 
unsigned formulas, while the conclusions contain only signed ones. By K-rules, 
we can equivalently assume a | a as initial sequents, where o is an expression. 


THEOREM 1 (Soundness of the K-calculus) Jf A is K-provable then A is a K- 
theorem. 


Proof. We translate sequents to formulas suitably adapting to the modal sequents 
case the Schiitte translation [15]: let * be the translation from expressions to for- 
mulas of L defined as: 


AX = A 
(A)* = OA 
[A] = OA 


where A is a formula; we extend that translation to sets of expressions, Y* = (vo, 
2, Val* = (V5... pm}, to sequents, (T F A)* = A(T*) — V(A*), and to 
sequent-style inferences into Hilbert-style inferences, 


. . * . . 
premise, premise, | (premise,)*, .. . , (premise,,)" 
conclusion (conclusion)* 


By x initial sequents are clearly translated to tautologies, and sequent-style infer- 
ences are translated to Hilbert-style inferences, so that the translation of a K- 
provable sequent is also Hilbert-style provable in K (see [1, Theorem 1]); so if A 
is a K-provable formula, i.e. if Ø + A is an end-sequent, then Fg (Ø + A)", i.e. 
(by the definition of +) Fx A, that is the thesis. 


To develop calculi for other NLs we introduce the modal rules T', 4,5, B, D. 
These rules are quite immediate translations of modal axioms into the formalism of 
the calculus of sequents (this correspondence is explicitly shown in the following 
Theorem 2). However, these rules respect the ‘introduction’ nature of the calculus, 
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and in fact involved expressions do not decrease in complexity. 


ATHA l-A,A 
[A], PF A TEA, (A) 
(AjTFA | PRAIA) 
(OA), PEA Tr A,[D4] 
[A], Pr A Tr A, (A) 
(DA), TH A THA, [0A] 
ATHA THA,A 


(DA,DFA CB TEA, [OA] 
(A), PRA ' TE A, [A] 
[A], TF A l- A, (A) 


where A is a formula of L and I', A are sets of expressions. a 


THEOREM 2 (Soundness of Normal-Calculi) Jf A is A-provable then it is a A- 
theorem, for any NL A. 


Proof. The proof is the same used for Theorem 1, with a slight modification; we 
must only prove that also the translations of rules T, 4,5, B, D are Hilbert-style 
inferences. That is an easy check; e.g., fort T 


1) (TF A,A) hypothesis 

2) AT*)—\V(A*)VA definition of + 

3 OAA--7A axiom T applied to ~A 
4 A-904A 3, PC, Df 

5) A(i*) 7 V(A*)V OA 2,4,PC 

6) (LFA, (A))* definition of *. 


THEOREM 3 (Completeness of NLs-calculi) [fA is a A-theorem then A is A- prov- 
able, for any NL A. 


Proof. We transform Hilbert-style A-proofs into sequent-style A-proofs. Fixed an 
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Hilbert style A-proof, for each formula proved at some step we define a degree O 
of dependence on modal inferences as: 


Q(A) =0 if either kpc A or A is a modal axiom 

O(A) =n if Bo, ..., B, Hpc where each Bj is proved at some 
previous step and max.{0(B;) }o<i<s =n 

O(A)=n+1 if BE, A by a direct use of rule RK, with O(B) = n. 


That is a good definition (fixed a proof), and the degree counts the maximal number 
of nested uses of rule RK in the proof. Moreover, we can always reduce the second 
case to the following alternatives: either each B; is a modal axiom or is proved by 
a direct use of RK, or (only if O(A) = 0) we can also reduce this case to kpc A. 


Recalling that (e) if T -pc A then the sequent T + A is PC-provable (see 
Kleene [11]), we transform each Hilbert-style A-proof into a sequent-style A-proof 


by an induction on the degree, by suitable uses of cut (for details see [1, Theorems 
3, 4]). E 


THEOREM 4 (Cut-elimination for K) /f a sequent is K-provable then it is K- 
provable without a cut. 


Proof. We add to the usual cut-elimination proof for PC (see [17]) the cases regard- 
ing duality and modal rules. As in [1], we allow cut on expressions (that is sound), 
so to apply the inductive hypothesis to the duality rules - 0,0 F, F- 0,0 F, too. 


The rank of a proof P, r(P), is defined as usual, while the grade of P, g( P), is, 
as usual, the grade of the principal formula of its last inference, g(a), that indicates 
the complexity of o with respect to the non-structural inferences, but we compute 
it as usual with a slight modification: in fact, in [17] the only non-structural rules 
are the logical ones, each one introducing a new (logical) symbol in the principal 
formula; so the grade of a is immediately determined only by counting the number 
of logical connectives occurring in œ. In our case, each non-structural rule intro- 
duces a new (logical or modal or metalinguistic) symbol, but O H,- O, Q F and 
F © also delete a metalinguistic one: so the global number of symbols does not 
increase; however, only such rules introduce modal operators, so that we solve the 
problem assigning to each modal operator a grade 2 (1 for itself 4-1 for the deleted 
metalinguistic symbol). So: 


g(P) = g(a) = - the number of logical connectives occurring in a, plus 
- the number of metalinguistic symbols occurring in o, plus 
- twice the number of modal operators occurring in a; 


for example, we have g(A) < g((A)) < g(94A). 
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The rest of the proof is a technical routine, whose details can be found in [1, 
Theorem 5]. a 


Takano, [19], noticed that for system K the duality rules OQ A,- C16, 
QUI F, QUI ‘are derived ones’; for example DO F is provable by: 


AHA 
=A, A,H 
TH A, [A] (74), [A] H 
(~A), TH A 


and ‘omission of these rules causes no difficulty in the cut-elimination proof for K’. 


Goré [10] noticed that any attempt to prove the S4-theorem QO(OP — OOP) 
without cut fails; the reader can construct a cut-free proof using the semantic modal 
sequents presented in the next section, and can see that such a proof requires using 
non-modal rules (e.g. weakening) between nested modal rules. That is not allowed 
by sequents with modal signes, since their structure is too simple, so that we must 
use cut. For a simpler example, let us consider how a cut-free proof of the K4- 
theorem OA > O(B — OA) should appear: 


CIIM 
[A] + [DA] 
[A] - [B > DA] 
DAF O(B > DA) 
- OA > O(B > DA) 


weakening + and -> into [. . .] 


So calculi of sequents with modal signs work uniformly well for the main 15 
NLs, but do not seem enjoy cut-elimination for systems other than K. In certain 
a sense, we have distinguished between two levels, an usual one, and a metalin- 
guistic level controlled by modal signs, making that distinction on formulas. But 
rules corresponding to the axioms that affect more than one nested modality (e.g. 
B) must use both modal signs and modal operators together (e.g. (L1A)), since do 
not have at their disposal more than one metalevel; furthermore, we cannot com- 
pletely manage formulas when contained into modal signs. So, as a natural evolu- 
tion, we extend those non-standard sequents adopting many levels organized in a 
tree-hierarchy, and making that distinction on sequences of formulas (actually on 
usual sequents). The calculi we develop offer a wider uniformity, solving the prob- 
lem of cut-elimination for the main NLs. 
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3 SEMANTIC MODAL SEQUENTS 


Cut-free calculi of sequents for normal modal logics are developed in [2] by us- 
ing semantic modal sequents, which are trees of usual sequents plus an accessibil- 
ity relation, and by introducing modal operators when moving formulas along the 
branches of such trees. 


We use only two general modal rules for all NLs O F- and F O (with a technical 
exception for systems containing the axiom schema D); we vary the first rule when 
varying the system depending on the accessibility, while we fix the second rule for 
all. We prove the completeness of our calculi for NLs, giving a semantic proof of 
cut-elimination, and obtain a uniform treatment that work well in every case. A 
semantic modal sequent is a triple (W, —, R) where W is a non-empty set of (oc- 
currences of) usual sequents called worlds (i.e. T - A, where I’, A are sequences 
of formulas of L), — is a strict tree-ordering on W and R is a binary relation on 
W that extends —, called accessibility. We often call ‘modal sequent’ a semantic 
modal sequent, and just only 'sequent' a usual sequent; furthermore, the locution 
‘occurrences of’ indicates that several instances of the same sequent can occur as 
different worlds in a modal sequent. 


The first two components of semantic modal sequents, W and — , give rise to 
trees of sequents, that we can also inductively define by: 


[tA  isatree of sequents, where D, A 
are sequences of formulas of L; 
r- A  isatree of sequents, where D, A 
J/...XV are sequences of formulas of L and Ao, ..., An(n > 0) 
Ao.--An are trees of sequents. 


We use capital latin letters for formulas, capital greek letters for sequences of 
formulas, small latin letters for worlds and small greek letters for trees. The other 
component of semantic modal sequents, the relation R, puts the corresponding as- 
pects of the semantic accessibility relation into sequents: namely, for a system hav- 
ing as modal axioms Az, ..., AZ, the accessibility R is the minimal relation con- 
taining >, R(Az,),..., R(Az,), where the correspondence between axioms and 
relations is given by the following table 
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R(Ax) 
— 
the reflexive closure of — 


the transitive closure of — 
the euclidean closure of — 
the symmetric closure of — 


In the case of the axiom D, since we cannot uniquely determine R as the serial 
closure of —, we consider a new rule, namely the empty rule; when proving com- 
pleteness, that rule (read upward) really allows us only to add unessential accessible 
worlds that maintain seriality in the countermodel under construction. 


The rules for propositional connectives are ^ F,F 7,A E,F A,V E,F v, 
—H,F- (see [17]) with suitable modifications for the use of trees. First, as to no- 
tation, two semantic modal sequents a = (W, —, R} and 8 = (W',—', R') are 
similar by (w, w’), where w € W and w' € W', (written asa w ~w f) iff they 
become the same by changing only w with w’, i.e. iff W — (w) = W' — (w'), and, 
given the bijection f between W and W” that is the identity except that f (w) = w’, 
we have both wo — w1 iff f(wo) —' f(wi) and wo Rw; iff f (wg) R' f (wi), for 


. In 
every wo,wı € W. Then, given a usual PC-rule 57—, where a',a”,b are usual 


. . . i ‘i 
sequents, the corresponding tree version of that rule is ==, where a’, o", 8 are 
B 9 9 


semantic modal sequents and a! a ~ar o" an ~b B y ~a a! (we can apply a 
similar reasoning to rules with only one premise). So, the tree version of a PC- 
rule leaves unaltered both the structures of the tree and of all the unaffected usual 
sequents. For example, now the rule —F appears as: 


^l ^l 
+ 4 
TFA,C D,TIE A 
JN ZN 
04 ...Ó4 61...0n 
^l 
l 
C= DT, IF A,A 
LN 
01 ...Ó04 
where y, 01,...,0n are trees of sequents (when n = 0 no ó; appears), T, A, IL, A 


are sequences of formulas, and C, D are formulas of L. The structural rules are the 
tree adaptation of the usual weakening +, -weakening, exchange H, + exchange, 
contraction H, F contraction (see [17]). 
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The modal rules are L1 | andt CJ, and move formulas along trees. Rule O H 
varies when varying modal system, depending on the accessibility relation R: 


OF from a modal squent a having two worlds T - A and A,II F A with 
(T - A)R(A,II + A) infer the modal sequent 8 obtained from a by sub- 
stituting (both in the domain and in the relations — and FR) those worlds 
with LA, T - A and II F A, respectively. 


Rule + O is the same for all modal system, since it depends on — instead of on R: 


+O from a modal sequent o having two worlds I | A and A, where 
+ Ais a terminal leaf and (T + A) — (F A), infer the modal sequent 
B obtained from o by substituting (both in the domain and in the relations 
— and R) the former word with T F A, DA and by completely deleting 
both the latter world from the domain and any occurrence of it from —, 
and consequently modifying R. 


For example, when R is the transitive closure of — those rules appear as : 


OF ^y "y FO 

+ + 

TFA TFA 

a ee or N 
O1...0, FA Opyt--.Om 
+ 

AIFA Y 

ZN + 

01...04 TFA,OA 

J|. 

^Y 01...0m 
+ 

DA,yFA 

Jeedl.VX 
l 

IFA 

J-N 

6, ...6n 


when n = 0 no 6; appears. Note that the sequent F A is a terminal leaf to do not 
introduce a disconnection in the tree of sequents. 
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Finally, for ‘serial’ systems, i.e. for those systems containing the axiom D, the 
empty rule holds: 


empty rule: from a modal sequent o having a world } as terminal leaf infer the 
modal sequent ĝ obtained from a by deleting that world (when not 
occurring as top node of a) from W and from both the relations R and 
=>: 


2 [re2 


The notions proof and end-sequent are used as in standard calculi of sequents 
(e.g. see [17]), an initial sequent is one of the form (W, +, R) where A- A € W 
for some formula A, an usual sequent’ - A is provable when (W, —, R) is an end- 
sequent with W = {T - A}, and a formula A is provable when | A is provable. 


THEOREM 5 (Soundness of the NLs-calculi) For any NL A, if A is A-provable 
then A is a A-theorem, and so it is \-valid. 


Proof. As usual, we translate semantic modal sequents into formulas by suitably 
modifying the Schiitte translation; namely, we define the translation * by an induc- 
tion on the complexity of trees (see [2, Theorem 1]): 


(I-A) = AT-9VA 
TFA \* 
J-N = ALTo(VAVvUASV...vVDA?) 
Ao... Àn 
where A is a formula of L,I and A are sequences of formulas, Ao,..., An are 


modal sequents and, by definition, A Ø = T and V Ø = L. Only the relation 
— influences the translation *, since there is no mention of R, which is indirectly 
reintroduced in this theorem by using modal axioms. We extend the translation to 
sequent-style inferences into Hilbert-style inferences as usual, proving that initial 
sequents are translated to theorems (see [2, theorem 4]), and that sequent-style in- 
ferences are translated to Hilbert-style inferences (see [2, Theorem 5]), so proving, 
as usual, that if a formula A is A-provable, i.e. if - A is an end-sequent, then A is 
a A-theorem. | a 
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Now we prove completeness and semantic cut-elimination together. A sequent 
proof appears as an inverted, at most binary, finite tree, whose root is the end-sequent 
(at the bottom), and whose terminal leaves are initial sequents (at the top). The way 
of moving along such reversed trees allows us to prove completeness: in fact, when 
moving downward we consider that tree as a ‘proof’, otherwise when moving up- 
ward we consider it as an attempt of constructing a ‘countermodel’; we really prove 
that when a usual sequent is not cut-free provable we can construct a countermodel 
of it (really of its Schiitte translation), so that it is not valid. 


THEOREM 6 (Completeness of the NLs-calculi) For any NL A, if A is A-valid 
then A is A-provable (without any use of cut). 


Proof. As usual, we prove the statement by contraposition: if A is not A-provable 
(without any use of cut—anyway we can rewrite it in the modal-sequent formalism) 
then A is not A-valid. We proceed by several steps, using a simplified version of 
the proof used in [8] for PC, adapted to modal systems (note we have only proposi- 
tional symbols, and transform Schiitte valuations directly into Kripke valuations). 
Namely, for any NL A: 


1. we fix aclass of functions on Kripke frames (the Schiitte valuations) that pre- 
serve the values true t and false f of formulas when going along a proof-tree 
as countermodel and when requiring every formula that appears on the left 
part of a sequent to be true, while any on the right to be false. Really, we 
define such functions by imposing conditions that must be respected when 
going along the tree as a proof, so that we can define them by an induction 
on the complexity of formulas (that increases downward); 


2. we prove that for any usual sequent I‘ - A that is not cut-free provable there 
is a Schütte valuation on a Kripke frame such that for some world the value 
of AT is t while the value of V A is f; 


3. we prove that any Schütte valuation can be reduced to a binary valuation (i.e. 
a valuation that can take only the values t and f not u) that preserves the val- 
ues t and f (so, there is a Kripke model where I | A is not valid, proving 
completeness); 


4. we introduce three-valued models, proving both that any Schütte valuation 
can be reduced to a three-valued model valuation that preserves the values t 
and f (so, there is a three-valued model where I - A is not valid) and that 
any three-valued model valuation can be reduced to a binary valuation that 
preserves the values t and f (so, proving completeness in another way). 


Let us examine each step (for more details see [2, Theorem 6]): 
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1. Given the set of well-founded formulas of the language, wff(L), a set of worlds 
W , a Strict tree-ordering on W, —, a relation R extending — and according 
with the properties of the accessibility for the system A, and a set of values 
(t, f, u}, we extend a Schütte valuation S : W x wff(L) — {t,f, u} (see 
[8, Definition 3.1.3]) to modal operators: for any w,w' € W, 


if exists w' s.t. wRw' and S(w', A) Zt then S(w, A) Zt 
if for every w' s.t. wRw', S(w', A) Zf then S(w, A) Z f. 


2. We define the domain W, the relations — and R, and the function S that fal- 
sifies I' - A on the basis of an infinite branch of an inductively constructed 
reversed tree of semantic modal sequents. Namely, we start from T - A, and 
at any level of depth, we examine every sequent of that level, giving rise to 
the sequents of the next level: since any sequent can construct at most two 
other sequents, at any level we have a finite number of sequents; for any of 
those sequents, we examine only one formula of only one world (if any) ter- 
minating the examination of that level in a finite number of steps. At the level 
n, for any sequent we need a list of its worlds, and for any world we need a 
list of its formulas: we examine the first formula of the first world in those 
lists, possibly we add new worlds or new formulas at the end of such lists 
(to not bypass the queue) and then we put the just examined world and for- 
mula at the end of the corresponding lists (cyclically rotating them, so that 
in a finite number of steps we can reach any formula of any world). 


For the sake of simplicity, an index n stresses only the level where a sequent 
Yn occurs, without specifying the branch (in fact we are really interested only 
in one infinite branch of that tree); list, and list, (v) denote the corresponding 
list of worlds and, for any world v, the list of formulas, respectively. 


Finally, since the worlds of a semantic modal sequent are really usual se- 
quents, with a left and a right part, and since those parts can change at any 
step (really, can only increase), we need two other lists, namely left, (v) and 
right, (v), the left and the right sequence of formulas of the world v at the step 
n, respectively. So, let yo = (Wo, 0, Ro) with Wo, = {vo}, left(vo) =I, 
right(vg) = A, —>o= Ø and let Ro be the suitable closure of —4 ; let also 
listo = vo (the listing of the worlds of Wo) and listo (vo) = I’, A (the listing 
of the formulas of vo). Given y, = (Wn, >n, Rn) with list, = vo,..., Us 
(never empty, since only increasing) and list; (vo) = A1, ..., Aj (possi- 
bly empty), let 4,41 (possibly y5,, and ^; ,,), list; 41 and list; 41(v) (for 
v € Wn+1) be defined in the way shown in [2, Theorem 6], that is an adap- 
tation of the way indicated in [8, Theorem 3.1.9]; here we show only modal 
Cases: 


i) if A; is an atomic formula and the system A contains the axiom D, then let 
Wai = Wa U {vs41}, —n4177a U{vo — Us+1}, Rn41 be the suit- 
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able closure of —n44 (it is easy to prove that it contains Rn), listn4; = 
U1,..., Us, Us+1, U0; lista 1 (ug) = Ag,... ; Ap, A, left(vs41) = J, right 
(vsti) = Ø and listn41(vs41) = Ø (we really add only a serial queue for 
vo); 


if A1 is DA, and it occurs in the left part of vo, and R(vo) = (v € W : 
vo R, ) Æ Ø, then let left, , ; (vo)-left, (vo), listn+1ı (v0) = A2,..., Ap, A1, 
left, ,1(v) = A, left, (v) and list, 4.1 (v)=list, (v), A for any v € R(vo); let 


also W441 = Wa,—441—7—92, Angi = Rn, listn41 = U1, . . . , Us, v0; 

if A; is DA, and it occurs in the left part of vg, R(vo) = Ø, and the system 
A does not contain the axiom D, then let 444 = Yn, but list, 41(v1) = 
A35, ... , Ap, Al, and list, 41 = U1,-.--,Us, V05; 

if A; is DA, and it occurs in the left part of vo, R(vo) = Ø, and the system A 


contains the axiom D, then let W,41 = Wn U {vs41}, 954174 U(vo > 
Us41], Rn+1 be the suitable closure of —>n+1 (it easy to prove that it contains 
Rn), list;41 = v1,..., Us, Us41, vo, leftn41(v0)=left, (vo), listn41 (vo) = 
A», ... , Ap, Al, left(vs41) =ø, right(vs+1) = @ and listn41(Us41) = Ø 
(we really add only a serial queue for vo); 


if A, is OA, and it occurs on the right part of vo , then let W,,1 = Wn U 
{Us41}, 941273 Utvo > vs+1 }, Rn41 be the suitable closure of >n+1 
(it contains Rn), listn+1 = ^ Up...,Us,Us41,00,. Tightr41(vo)= 
right, (vo), listn+1 (vo) = Ag,..., Ap, A, right(v,.4 1) — A, left(vs41) = 
Ø and listy41(vs41) = A. 


If Yn41 is (both 7;,,, and 47,4 are) cut-free provable, then yn is cut-free 
provable too; so, since we stop the construction of the tree on initial sequents, 
if the tree is finite then it is a cut-free proof of yo, against the hypothesis; thus, 
the tree must be an infinite, denumerable, (at most binary) tree of sequents; 
by König lemma, an infinite branch exists: let yo, ...,y;,..., (i < w) be 
the list of the sequents on that branch; let W = U{W; : W; e y, >= 
U(2:—€ yi}, R = U(R; : Ri € vi) (for non-serial systems) or let 
R be UR; : R; € yi} plus the reflexive closure of non-serial worlds (for 
serial systems); let also left(w) = U{left;(w) : i < w} and right(w) = 
J{right;(w) : i < w}, for any w € W. R satisfies the properties required 
for the accessibility relation of the system A. 


Now, let S be the valuation defined as S(w, A) = t iff A occurs in left(w), 
S(w, A) = f iff A occurs in right(w), S(w, A) = u otherwise. S is well- 
defined and is a Schütte valuation: for example, we prove by contraposition 
that S respects the last modal condition on Schütte valuations: let us assume 
S(w, DA) = f; by definition of S, LIA eright(w); so, by definition of W, 
from an index r on, DA must appear on the right of w; by construction of the 
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tree, when DA is to be examined after a finite number m of steps, A must 
appear on the right of some w' with w 4;4m w'; so S(w', A) = f with 
wRw’ (by the definitions of W, > and R). 


Finally, by construction, S(vp, AT) = t and S(vo, V A) = f. This com- 
pletes the step 2. 


. Let < be an ordering on (t, f, u} (in [8] was used the symbol >) defined as 
u <f,u <t,u < u,f < f,t < t; < induces on ordering on the Schütte 
valuations: 


S«T iff S(A) < T(A) for every A € wff (L); 


it is easy to see that any <-chain has an upper bound, so that for any Schütte 
valuation S there is a <-maximal valuation V such that S < V; but a max- 
imal valuation must be a binary one, and, by the conditions on Schütte val- 
uations, must be a Kripke valuation. Since < maintains the values t and f, 
V (vo, AT) = t and V (vo, V A) 2 f, so that V (vo, AT > V A) = f; as 
particular case, when the sequent is A we have V (vo, A) = f , proving 
completeness. | 


. A three-valued model is a triple (W, R, m) where W is a non-empty set, R 
is a binary relation on W, according with the properties of accessibility for 
any system A,m : Wx wff(L) —> {t,f,u} is a function defined by an 
induction on the complexity of formulas as usual (see [8, Definition 3.2.2]), 
plus the following modal cases: given w € W and R(w) = (w' : wRw'), 


if for every w' € R(w) m(w', A) =t . thenm(w,L1A) =t 
if exists w' € R(w) with m(w', A) =f then m(w, DA) =f 
if for every w' € R(w) m(w', A) Zf 

and exists w’ € R(w) m(w', A) =u then m(w, DA) = u. 


A three-valued valuation is really a Schütte valuation where the values of 
formulas are strictly determined by the values of their subformulas [8]: in 
fact, when both the value of A and the value of B ist then the value of AA B 
could be either t or u for a Schütte valuation, while must be only t for a three- 
valued valuation. 


To stress the link between three-valued valuations and Schütte valuations, we 
introduce a refinement of the ordering «1 , namely we define the relation < 
on the Schütte valuations as: 


S«T iff SaTandS|p- T |p 


it easy to see that any «&-chain has an upper bound, so that for any Schütte 
valuation S there is a «-maximal valuation m such that S « m, and that 
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such a maximal valuation must be a three-valued valuation. Since < (as «1) 
maintains the values t and f m(vo, AT) = t, m(vo, V A) = f. Further- 
more, we can restrict the relation < to the three-valued valuations: reasoning 
as above, for any three-valued valuation m there is a «1-maximal 
valuation V such that m < V; such a V must be a Kripke valuation and 
V(vo, AT — V A) = f; as a particular case, when the sequent is - A we 
have V (vo, A) = f, proving, in another way, completeness. 


Accessible worlds for modal logics were introduced by Kripke [12], where a tableau 
system based on auxiliary tableaux was presented to prove completeness for S5. 
The construction of countermodels in our completeness theorem for modal-sequent 
calculi gives an immediate technique for transforming our semantic modal-sequents 
into Kripke's tableaux, once we have accepted to allow formulas to be reused in a 
tableau, renouncing to discharge formulas or worlds. 


4 MODAL TREE-SEQUENTS 


Sequents with modal signs adopt a two-levels structure and syntactical rules; se- 
mantic modal sequents use a tree structure and semantical rules based on accessi- 
bility. By modal tree-sequents (see [4]) we try to combine the rich tree-structure 
of latter sequents with the syntactical approach of the former. Our goal is to prove 
that only reinforcing the structure of sequents sufficies to obtain calculi for NLs that 
work uniformly well, but instead of confirming in our opinion, modal tree-sequents 
open new questions. In fact, individuating rules that work well for NLs is quite 
easy (and allow us to link in natural way tree-sequents both with semantic modal 
sequents and with systems of natural deduction based on strict implication, pre- 
sented in [3]), while eliminating cut requires to mimic accessibility by syntactical 
rules, so requires extra rules both when considering single axioms, as for axiom 5, 
and when combining many axioms, as for axioms B with 4. 


Modal tree-sequents are trees of sequences of formulas with specific modal rules 
in correspondence to different modal axioms, which characterize modal operators 
by the way formulas move along trees. Tree-sequents adopt the tree-structure of 
semantic modal sequents (using trees of sequences instead of trees of sequents), 
but rules only have syntactical behaviours, since do not use accessibility. Without 
accessibility, a direct proof of completeness (in [8] style, like in [2]) is not easily 
practicable, so that we must translate Hilbert-style systems into our calculi. To do 
that we need cut rule, that eliminate in a syntactical way, adapting to tree-sequents 
the usual proof in Takeuti [17]. 
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We define tree-sequents by induction on complexity of trees: 


D is a sequent, where [ is a (possibly empty, e) 
sequence of formulas of L; 


do 
l'à : is a sequent, where I’ is a (possibly empty sequence of formulas 


On 
of L and ôo, . . . , ôn (n > 0) are sequents. 


We call trivial the former and non-trivial the latter. An old usual sequent T - A 
is now represented by the tree-sequent T { A, while the tree-structure of a semantic 
modal sequent (see [4]) like 


DEHA Ao 
To F Ao 1 ! js now represented by Tp 4. I, {Ai 
I? F A2 T2íAs 


As to notation, we use capital latin letters for formulas, capital greek letters for 
sequences of formulas, small greek letters for tree-sequents. Furthermore, 'e' rep- 
resents the ‘empty tree’ (i.e. a tree without nodes and branches), and, for sake of 
simplicity, we use a single tree-sequent to refer to the many branches departing 
from the same node: e.g. 


Bo Bo 


o(l(Bdenoteso(l' 4 : — , where ‘e{@’ actually represents è < : 
Bn Bn 


The structural and PC-connectives rules are ~ F,F ~m, A F,F A,V E,F v, 
—Lbi,F-, weakening F, weakening, exchange +, exchange, contraction H, H- 
contraction, with suitable modifications for the uses of trees. Namely, in the rules 
involving two premises, the two upper sequents must be equal in the structure of 
trees and in all the non-affected nodes; rules work only on non-trivial sequents, and 
right rules work on the terminal leaves, while left ones work on the other sequences. 
As to PC-rules, for example =>}, l- are: 


AF d m a{D.I} f F3 acr | Np 


p : p 
aC 5 Drm] AA arf A cup 


160 CLAUDIO CERRATO 


We only need an extra rule to manage degenerated trees; that presented in [4] is 


merge 
»" e 
B 
a{B 

axe 


Modal rules control moving formulas along trees; we have a right rule common 
to all NLs: 


FO 
a(e(c 


a{OC 


where a Æ e, anda left rule depending on the system, decomposed into many rules, 
one for each modal axiom: K: 


c 
«tl Grig 
fo 
«(c.n r{B 
where 8 Æ e. 
T: 
a{Cc,T{B 
a{OCc,T{B 
where 8 Æ e. 
4: 


o 
amf ne, T{2 
o 


ocni Zy 
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where 8 £ e. 
5: 
II(ó 
«| cire 
OC, II(ó 
"| T( 
where 8,ó Æ e. 
B: 
aiar - 
o 
str | Bong 
where 8 Z e. 
D: 
a(e(oe 
a[o 
where o # e. 


For modal rules, as for structural ones, we consider a formula as occurring in the 
right part of a sequent when it belongs to a terminal leaf, while we say it occurs in 
the left part of that sequent when it belongs to any other node. So, in the rule - C) 
the affected formula remains on the right, while in the rules K,T, 4,5, B, D the 
affected formula remains on the left: all modal rules do not change the side where 
the affected formulas occur, but move them towards the top of the tree, like putting 
them on an upper metalinguistic level. The notions of proof and of end-sequent are 
defined as in standard calculi of sequents (see [17]); an initial sequent is one of the 
kind 


ed ALA 


where es denote generic trees made only of empty sequences 2; a formula A of L 
is provable iff @{A is an end-sequent. 
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Weakening, exchange and contraction of branches are derived rules (see [4]): 


tree-weakening tree-exchange tree- contraction 


ai Ib a} 5 
5 B 

a} 5 a{B af § 

TET axe axe. 


THEOREM 7 (Soundness of NLs-calculus) /f A is A-provable then A is a A- 
theorem, for any NL A. 


Proof. We define the Schütte translation * by an induction on the complexity of 
non-trivial tree-sequents (using also a companion function F, that also works on 
trivial tree-sequents) as follows: 


(A)? = VA 
dy 

r4: = AT => (ôF v... v ôH) 
Ôn 
so bo ^ 

y4 : = O1y¢: = AT => (ôf v... v ôH) 
Ôn On 

where I’ and A are sequences of formulas, ĝo, . . . , Ôn are tree-sequents. 


So, when translating a sequent, the first implication is material, as usual, while 
the other ones are strict (resembling that done for natural deduction based on strict 
implication for NLs in [3]); in this sense any level of depth along branches can be 
viewed as a metalevel w.r.t. the deeper ones; in particular, the first level is like a 
metalevel w.r.t. all the others. As usual, we extend the translation * to sequent-style 
inferences into Hilbert-style inferences, and prove, in a way similar to semantic 
modal sequents, that initial sequents are translated to tautologies, and sequent-style 
inferences are translated to Hilbert-style inferences, so to immediately obtain the 
thesis (for a detailed proof see [4]). NH 
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THEOREM 8 (Completeness) /f A is a A-theorem then A is A- provable, for any 
NL A. 


Proof. As standard proof, we show that modal axioms are provable and that our 
calculus is closed under rules RPL and Necessitation. 


1. If A is PC-deducible from A;,...,An then the sequent A;,...,A, F A 
is PC-provable (see [11]), i.e. since the PC-fragment of any A- calculus is 
really a PC-calculus, A;,...,An{A is PC-provable; so if Ai,..., An are 
A- provable (e.g. &(41,..., 9 (A, are end-sequents) by n uses of cut A is 
A- provable, too; i.e. the rule RPL. 


2. Any modal axiom among K,7',5,4, B, D is provable in a calculus contain- 
ing rules K,T,5, 4, B, D, respectively; for example: 


K: 
hypothesis 
B{A{A @S{B{B 
>F 
g(A — B, A(B 
—————————————— OF twice 
DA,D(A > B){o{B -n 
OA,O(A > B){OB 
— l-— twice 
{O(A > B) 2 (DA > OB) 
4: 
hypothesis 
ø{0A{0A 
DA{s{OA 
OA{OOA 
g(DA > ODA 


3. Our calculus is closed under Necessitation, i.e. if A is A-provable then ODA 
is A-provable, too. In fact, we can easily modify all of the proof of À (i.e. of 
the sequent 2 ( A) by substituting any sequent o (especially the initial ones) 
with 2 (o, so obtaining a proof of @{@{A; by rule - O we immediately 
have @{DA, i.e. a proof of DA. 
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For cut-elimination the question is unexpectedly complicated: in fact we can 
easily adapt the proof in [17], once we have proved that a left formula can be moved 
along a tree according with the corresponding accessibility relation (see [4]). But 
e.g. rule 5 allows us to transport a formula from a sequence to its brother, but not 
upward or downward the tree, and combining rules B with 4 (and with usual O F) 
we can move a formula as upwards as we like, but only one step downward. 


For example, we cannot prove the K5-theorem QÀ — 00104, which requires 
a formula A moving to the son of the brother when becoming DA (or equivalently 
requires QA moving downward); a cut-free proof should appear as: 


AA{AA 
o l MM 


gi3A 
2 { IO Ao 


{nA 
af d 


not allowed by the only 5 rule 


b= 


So we must allow a formula C to move according with euclidean accessibility 
(when becoming OC): in [4] that movement is splitted in four distinct rules: O F, 
which transforms C into OC moving of one step upward, 51, which moves OC to 
the father (when is not a top- node), 52, which moves OC to the brother (when is 
not a top-node), 53, which moves OC to the son: 


51 02 93 
{A A 
amf OC, r {ó af DC,T(8 afond ris 
A OC, IA A 
a{oc,nf r{6 af r{B amf DO, T (ó 
a, 0, A Ze a,d,A Ze ó,AZe 


Rule 53 just sufficies to prove completeness (in fact, rules O H and 53 imme- 
diately give rule 5), but all of them needs to eliminate cut (e.g. proving O(0A — 
ODA) requires rule 51). 


That opens a question: are not trees (of sequences or of sequents) the right struc- 
ture for syntactical sequent calculi for NLs, or can only usual modalities reach a 
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portion of the expressive power offered by trees (so requiring the use e.g. of tense 
operators to completely managing trees)? To be continued. 


5 CONCLUSIONS AND ACKNOWLEDGEMENTS 


Non-standard calculi for NLs show hidden aspects of deductive systems: sequents 
with modal signs stress the different role of modal and propositional behaviours in 
NLs proofs; that difference assumes an algorithmic value. 


Semantic modal sequents combine sequents with Kripke models, directly intro- 
ducing the semantic feature of accessibility into the calculus; the consequence is a 
uniform treatment for NLs which works well, is cut-free, and offers a natural com- 
pleteness theorem. 


Modal tree-sequents combine a tree-structure of levels with syntactical rules, so 
representing a link between the other two formulations; those calculi work well and 
are cut-free, but the rules necessary for eliminating cut are more than we need to 
prove completeness, opening an unexpected gap. 


Finally, I would like to acknowledge my gratitude to Prof. M. Fattorosi-Barnaba 
for the many helpful conversations I had with him about the topic of the present 
work. 


via di Bravetta, Italy. 
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KOSTA DOSEN AND ZORAN PETRIC 


MODAL FUNCTIONAL COMPLETENESS 


INTRODUCTION 


This paper is a companion to [3], where it was shown how a modal version of the 
deduction theorem induces modal extensions of substructural logics. Here we shall 
prove some of the results announced in the concluding section of [3]. Namely, we 
shall prove modal functional completeness for categories corresponding to the main 
modal substructural propositional logics. This modal functional completeness is 
related to the modal version of the deduction theorem of [3] as functional complete- 
ness for bicartesian closed categories 1s related to the deduction theorem for intu- 
itionistic propositional logic. Roughly speaking, the deduction theorem says that 
the system is strong enough to express its own deductive metatheory. Functional 
completeness says that the deductive metatheory can be embedded in the system. 


Functional completeness for categories is a property of the same kind as com- 
binatorial completeness for systems of combinators. Combinatorial completeness 
permits us to define functional abstraction and demonstrate the equivalence with 
systems of lambda terms. Functional completeness permits us similarly to find sys- 
tems of typed lambda terms as internal languages of closed categories. A well- 
known use of typed lambda terms in proof theory, which goes by the name of the 
Curry—Howard correspondence, is to serve as codes of natural-deduction proofs. 
If categories are conceived as logical systems with an equivalence relation among 
proofs, which is induced by normalization, this coding and extracting the internal 
language boil down to the same thing. 


We learned about these matters from [6] and refer to that book for a demonstra- 
tion of the importance of functional completeness. We shall try to follow the style 
of [6], so that our results may be compared with the standard functional complete- 
ness results proved there for cartesian, cartesian closed and bicartesian closed cat- 
egories. An acquaintance with [6] may also help to grasp the proof-theoretical im- 
port of what we will present here—a matter about which we don’t have space to 
say much. Further logical motivation may be found in the aforementioned paper 
[3] and in [4]. We refer to these works only for motivation. Otherwise, our paper 
will be rather self-contained. 
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For our functional completeness results we concentrate on categories corres- 
ponding to what we take to be the minimal substructural logic: namely, Lambek's 
nonassociative calculus. The modal postulates assumed for these categories are 
those of S4 plus modalized versions of the missing structural rules—the same kind 
of modal postulates one finds in usual presentations of linear logic. The modal 
functional completeness theorem proved for these categories can easily be extended 
to categories corresponding to better-known substructural logics, like linear and 
relevant logic, as we indicate in Section 6. Working with the minimal substructural 
logic has technical advantage and should help dispelling the impression that this 
sort of theorem, as well as some others, is exclusively tied to a particular substruc- 
tural logic. Results about linear logic are often presented in a fashion that fosters 
such misleading impressions. For example, modal translation results (which are 
not foreign to the matter we are treating) are presented as if they had to do with lin- 
ear logic specifically, though analogous results may be obtained as easily for other 
substructural logics. 


We shall devote a section to a transformation of our proof of modal functional 
completeness into a proof of ordinary, nonmodal, functional completeness for 
cartesian, cartesian closed and bicartesian closed categories axiomatized in a non- 
standard manner. Namely, instead of having as primitives projection arrows and 
a pairing operation on arrows, we have arrows corresponding to structural rules, 
and we show how with arrows corresponding to the structural rules of thinning and 
contraction, Mac Lane’s axiomatization of symmetric monoidal categories, which 
among other things has arrows corresponding to association and permutation, can 
be extended to an axiomatization of cartesian categories. This means putting carte- 
sian categories in a substructural perspective, where we envisage rejecting struc- 
tural rules. 


We produce such nonmodal substructural categories when in a separate section, 
at the end, we consider restricted nonmodal functional completeness for them. The 
restrictions in question are obtained by having special notions of what in the ter- 
minology of [6] is called polynomials in polynomial categories. This is related to 
having restricted classes of typed lambda terms as codes of proofs in substructural 
logics. We leave for the future a more thorough investigation of this and some other 
matters mentioned in the concluding section. 


We also devote a section to justifying the assumptions made for our categories, 
and, in particular, assumptions related to structural rules. We shall show for a num- 
ber of them, and in particular the more abstruse, that they are not only sufficient, but 
also necessary for functional completeness. Most of these assumptions are quite 
well known and can be found in Mac Lane’s book [7]. There they are motivated 
differently: they appear in connection with coherence problems (cf. the very end of 
our paper). We motivate the same assumptions by functional completeness. Though 
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coherence may also be related to logic, we believe the sort of motivation we provide 
does more justice to the logical character of our categories. Anyway, it should be 
better than just taking over these assumptions from category theory, as it is some- 
times done when categories are presented as ‘models’ for logics. 


However, we didn’t find in the literature all of these assumptions: some of those 
tied to the structural rule of contraction may be new. Among them we single out 
something we shall call octagonal equation, a principle we shall use quite often, 
which is comparable to Mac Lane’s pentagonal and hexagonal diagrams of natural 
associativity and commutativity. 


To end this introduction, let us mention a notational and terminological matter. 
Though we shall in general imitate the style of [6], we shall diverge somewhat in 
notation. In particular, we write arrows in categories with a turnstile, as f : Al B, 
instead of f : A — B. This we do because we reserve — for implication, an oper- 
ation on objects, for which category theory uses an exponential notation or square 
brackets. We prefer to stay close to logical notation because our motivation is in 
logic. We want to suggest that objects in categories correspond to formulae, op- 
erations on objects to connectives, arrows to sequents, special arrows to axioms, 
operations on arrows to rules of inference, and equations between arrows to con- 
versions of proofs, like those we make in normalization. However, we dont want 
to diverge from category theory also in terminology. So we call f : A F Ban 
arrow, rather than a sequent. This hybrid between logical notation and categorial 
terminology may be slightly awkward, both from the point of view of logic and of 
category theory, but it just reflects the nature of our work. We use categories to say 
something about logical systems. 


Because of its connection with the deduction theorem and proof theory, func- 
tional completeness could perhaps be called deductive completeness. But, to follow 
the policy exposed in the previous paragraph, and to prevent misunderstanding, we 
prefer to stick to the established terminology. 


1 NL CATEGORIES 


A graph consists of a class of arrows and a class of objects, together with the func- 
tions that to every arrow assign the objects that are its source and target. We use 
f,9,h,..., possibly with indices, for arrows, and A, B, C,..., possibly with in- 
dices, for objects. We write f : A - B to say that A is the source of f and B the 
target of f. For such an f we say that it is of type AF B. 


A deductive system is a graph in which we have a special arrow for every object 
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14: AF A 
and the binary (partial) operation of composition of arrows: 


f:AF- B g: BEC 
gf: AF C 


A category is a deductive system in which the following equations between ar- 
rows are satisfied: 


(catl) For f : AF B, Isf=f, fla=f. 


(cat2) Forf: AF B,g: B- Cand h : C - D, h(gf) = (hg)f. 


An NL deductive system (‘NL’ stands for ‘nonassociative Lambek’) has the fol- 
lowing in addition to what every deductive system must have: 


binary operations on objects: €,—,«—-,^,V 
special objects: 1,T,1 


special arrows for every object A and B: 


ca:Ie AFA ĝa: AeI- A 
a’, :AFIeA 05 : AF AeI 
exp: Ae(A— B)F- B € pg: (Bt A)eAF- B 
TAB: ANBEA T, B: A^NBF- B 
TA: AF T 
Kap: AF AV B Kap: BFAVB 
ta: LEA 


‘a’ is to be associated with ‘sinister’, ‘ô’ with ‘dexter’, ‘r’ with ‘terminal’, ‘e? with 
‘initial’, and the superscript ‘2’ stands for ‘inverse’ or, maybe, ‘introduction’; the 
remaining notation for arrows and operations on them is modelled after [6]) 
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operations on arrows: 


f:AF B g:CF- D 


feg:AeCFrBeD 
f:AeCkKB f:CeAtFB 


*"f:CF AA B f':CF-B—A 
f:CF-A g:CF B 


(9 :CFAAB 


f: AFC g:BF C 
[fg]: Av BF C 


The minimal NL deductive system is an extension of Lambek's nonassociative 
calculus with the propositional constant I, the lattice connectives ^, V, T and L 
(lattice connectives are called ‘additive’ in the terminology of linear 
logic), and the associated arrows and operations on arrows. Note that the exten- 
sion with I and the c, 0’, 6 and ôt arrows amounts to adding some structural rules; 
namely, rules for dealing with the empty collection of premises. Hence, it 
shouldn’t be surprising that this extension is not conservative (it enables us to prove, 
for example, (A + A) — B F B). We shall call the ø, ot, ô and ó* arrows oô ar- 
rows. 


An NL category is an NL deductive system that is a category in which the fol- 
lowing equations between arrows are satisfied: 


e equations 


(e) For f; : Ay A- By, gi : By + Cy, fo : Ao F- Bo and go : Bo F 
Co, (91 © g2)(fi © f2) = (nfi) © (92 f2). 


(e1) lAelp-—lA.B 
có equations 


(o) Forf:AF-B, foA-—opgp(lie f). 
(ô) Fo f:A- B, fda = dp(f *1ri). 


(co*) oach =1a, 040A = liea 
(65°) 646% = 1A, 0504 — lad 
(có) of =O 


closure equations 
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(28) Forf:AeCFB, eXp(lae*f) =f. 

(29) Forg:CFA>B, “(eX p(laeg)) =g. 

(= 8) Forf:CeAFB, eG p(ftela) =f. 

(=n) Forg:CFBtA, (eas pg(gelA)) =g. 

bicartesian equations 

(AB) Forf:CF Aandg:CF B,nAB(f.g) =f, maplf.g) =9. 
(An)  Foh:CF AAB, (rA ph,m, gh) — h. 

(T) Forf:AF T, ta=f. 

(VB) Forf:AFCandg: BFC [f,glkap=f, [f,g]K'4 B = 9. 
(Vn) | Forh:AVBFEC, [hka B, hw, g] — h. 

(L) For f: LFA, :iA-f. 


With the help of either (— 8) or (< 8) we can derive (e1): we have 


lAelp = EX AeB(LA e*14ep)(14 * 1p), with (cat 1) and (> 8) 
= la.2, with (e), (cat 1) and (> f) 


and can proceed analogously with (+ 3). However, we have preferred to include 
(e1) among the e equations so that we may have it even in the absence of — and 
«— (see Section 4). 


With the definitions 


Def —arlpef 
foD =a felp 


we can derive the following equations by using (cat 1) and (e): 


(e2) De(gf)—-(Deg)Def), (gf)eD —(geD)(f e D) 
(ebifunctor) (fi e B3)(Ai e fo) = (Bı e fX fa e Ag) 


Conversely, if the unary operations on arrows D e . and . e D are primitive instead 
of the binary operation on arrows . e ., we can define the latter by either of the 
following two definitions 


fie fo =at (fi € B3) (Ai © f2) 
fie fo =at (B1 © f2)(f1 © A2) 
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and derive (e) by using (e2) and (ebifunctor). With these unary operations primi- 
tive, (e1)is replaced by 


Delp = 1DeB, 14° D = 14D. 


When it is more convenient to work with D e _ and . e D instead of _ è _, we may 
freely avail ourselves of this opportunity. 


The first six có equations assert that o and 6 are natural isomorphisms. They 
can be replaced by the four equations 


(18) op(lief)o,=f, da(f 1094 = f 
(In) opfoa=lief, dsfoa=fel 


which are more parallel with the closure and bicartesian equations. The letters ‘3’ 
and ‘7’ in the names of all these equations should point towards the analogy with 
f and 7 conversion in the lambda calculus, which are themselves analogous to the 
conversions of two types of detours in natural deduction: introduction followed by 
elimination and elimination followed by introduction. However, the có equations 
as we have given them are more transparent. These equations are assumed by Mac 
Lane [7, VII.1] for monoidal categories. The difference is that for e in NL cate- 
gories we need not have a natural associativity isomorphism. 


For an NL category C, the operations . e . on objects and arrows determine a 
functor from C x C to C, i.e. a bifunctor, whereas D e . and . e D are functors from 
C to C. With the definitions 


D > f =at *(fED A) 
f — D =a (feba) 


we obtain the functors D — -and - — D fromC to C. We also have the definitions 


€p(A) =a ED, | €p(A) =af ED,A 
Np (A) =af *lpea H(A) =af lAeD* 


(conversely, we can define * f as (D — f)np (A), and f* as (f + D)nō (A)). 


The e and closure equations amount to asserting that -è . is a bifunctor, that for 
every object D the functors D e _ and D — . are adjoined, the first being left ad- 
joined and the second right adjoined, with the natural transformations €p and np 
being the counit and unit of the adjunction, and analogously with the functors -e D 
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and _ —— D and the natural transformations € and np . All this may be expressed 
by saying that NL categories are biclosed. The bicartesian equations amount to as- 
serting that NL categories are bicartesian with respect to the operations ^ and V, 
their terminal object being T and their initial object 1. 


The equations (— 7) and (+ n) can be replaced by the equations 


(*) Forf:AeDt Bandg:CF- D, *(f(14eg)) = (*f)g. 
For f:DeAt Bandg:CtD, (f(ge1,))* —(f*)g. 


(*&) "64. B = lass, €AB = lBc-A. 


The style of these equations is comparable to the style of the equation (c) and 
the second (aa*) equation (or (ô) and the second (56*) equation): the equations (*) 
say how the operations * permute with composition, whereas the equations (+e) 
exhibit the result of eliminating and then reintroducing an implication. Similarly, 
the equations (^An) and (V7) can be replaced by the equations: 


(fh,gh) =(f,g)h, (TA,B; TA B) = lars 
[Af hg] = Alf, g], [KA,B, K^ B] = lAvB- 


The equations ( T) and (.L) can be replaced by the equations: 


Forf: AF B, ta =Tef, Tr — lr. 
For f : BEA, LA = ftB, by =l. 


2 NLU CATEGORIES 


To define the modal NLO deductive systems and NLD categories we need the fol- 
lowing piece of terminology. First we define inductively the factors of an object 
in a deductive system that has the binary operation e on objects: A is a factor of A; 
if B e C is a factor of A, then B and C are factors of A. An atomic factor of A is 
a factor of A that has no factors save itself. Let us consider deductive systems that 
have the binary operation e and a unary operation [] on objects, and also the special 
object I. We shall say that an object OA in such a deductive system is boxed. An 
object is modalized if and only if each of its atomic factors is either boxed or else 
it is I. 


An NLO deductive system is an NL deductive system that, moreover, has the 
following: 
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unary operation on objects: O 
special arrows: 


rA : LIAFE A, for every object A, 


bypc:Ae(BeC)t (Ae B) eC, provided A or B or C is modalized, 
ba pc: (AeB)eChAe(BeC), provided A or B or C is modalized, 


cap: Áe B- Be A, provided A or B is modalized. 
kA: AFI, provided A is modalized, 
WA:AF AeA, provided A is modalized, 


operation on arrows: 


f: BF A 


= provided B is modalized. 
f :BF UA 


Note immediately that for modalized objects A we have the arrows 19 
A F DA. These and the r arrows would enable us to formulate the provisos for 
the b°”, b,c, k and w arrows by restricting ourselves to boxed objects A, B or 
C', rather than any modalized objects. However, for technical reasons, it is more 
convenient to have the provisos in the equivalent form above. 


The b~, b*,c,k and w arrows are related to the combinators usually named 
with the corresponding capital letters. A b^* arrow is used to obtain the arrow 


“(Eq BECA e 14^ B)bc. c. AA B) : A> BF (C A) > (C > B) 


which corresponds to the functional type of the combinator B taking arguments on 
the left-hand side, whereas b*~ is used for 


(€4.B(1B—A ecc A)PBcAA-C,O) : B —— AF (B + C) — (A c— C) 


which corresponds to the functional type of B taking arguments on the right-hand 
side. This explains the upper indices of b~ and b^. The b” and b“ arrows will 
be called b arrows. 


The b, c, k and w arrows are related to structural rules, too: b arrows to asso- 
ciation, c arrows to permutation, k arrows to thinning and w arrows to contraction 
(to give the full force of thinning, k arrows have to cooperate with có arrows), So 
we shall call these arrows structural arrows. We have said in the previous section 
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that the có arrows may be taken as structural. Such are also the arrows 14, which 
correspond to the combinator I. However, when we say here structural arrows, we 
mean the b, c, k and w arrows. 


An NLO deductive system has modalized forms of the structural rules missing 
from NL deductive systems; moreover, it has S4 modal postulates. We can replace 
the operation on arrows © by special arrows of types 


DO(A > B) H DA > OB 
DA - ODA 


and the restricted form of P where B is I; this form of 7 corresponds to the modal 
rule of necessitation. An alternative is to replace Ë by special arrows of types 


L1A e C1B F DI(A e B) 
IF Ol 


and the restricted form of H where B is boxed (cf. the derivation of in Section 
5). Still another alternative is to replace by special arrows of the last two types 
plus OA F OGA and the operation on arrows 


f: AF B 
Df:DAFUB 


An NLO category is an NLO deductive system that is an NL category in which 
the following equations between arrows are satisfied: 


O equations: 


(OB) Forf:BEA, rAf9-f. 
(Qn) Forf:BEOA, (rAf)H- f. 


b equations: 


(b Forf:AtD,g:BtEandh:CtF ((feg)eh)bzpso = 
bp g r(f * (g e h)). 


(bb) bzZ5cb4so-l(.m.c, bAscbABc-Aa(Bec) 


(cób) (ôa elg)bz;p =1laeos 
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(b5) bZscpbZ B.cep = (PAB, ° 1D) PA Bec p(14 ® bE cD) 
c equations: 


(c) For f: AF Candg: BF D, (ge f)cA,B = cc,p(f * 9). 
(cc) — cB,ACA,B = 1AcB 
(cóc) o4acar = ÓA 


(bc6) bc, gcA4eB,c bz B c = (Cac ela) bao p(la *cn,c) 
k equations: 


(k) | Forf: AF B, kA — kpf. 


Qk) k=] 
w equations: 


(w) Forf:AF-B, (fef)wa=wef. 
(ców) owi = li 

(bw) by 4 (la wa)wa = (WAela)wA 
(cw) CA,AWA = WA 


(bew8) If c"? 5 op =at ba cp. p(1Ae (bo p,p(es,c*lp)bg c, p)) PA, B,cep: 
CA B,A,BWAeB = WAe*WB. 


(ckw) 0 A(kA e 14)wa = lA (ókw) da(la e kA)WA = lA 
Of course, the arrows f in (O£) and (On) must have B modalized, and, like- 


wise, the other equations involve arrows with provisos for modalized objects. The 
equation (On) can be replaced by the two equations 


(B) (fg) = fog 


(Gr) rd =p, 
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which is quite parallel to replacing (—> 7) and (+ 7) by (*) and (*e); the r arrows 
are analogous to the € arrows, and the operation on arrows - is analogous to *. For 
the sake of example, let us derive (P): 


fg= fg 

ra(fg)} = r4fPg, with (D8) 
(ra(fg)9)9 = (r4f8g)H 
(fg)? = fHg, with (On). 


It follows immediately from (C18), (P) and (Ur) that for modalized A the arrow 
19 : AF DA is an isomorphism, its inverse being r4. 


If for f : A - B we define Of as (frA)U and t4 as 1B, then the functor D_, 
the r arrows and the t arrows make a comonad (or cotriple). It is easy to check that 
the r and t arrows are natural transformations and that we have equations corre- 
sponding to the three commutative diagrams of [7, VI.1, p. 135] (r corresponds to 
Mac Lane's e, and t to Mac Lane's ô). However, the r and t arrows and the opera- 
tion on arrows D- don't suffice to define our operation on arrows .P. As we have 
already noted, we need moreover arrows of types DA e OB t+ O(AeB) and I - OI 
(which in the axiomatization of [8, 9.7, pp. 87— 90] must be recuperated in a round- 
about way, via isomorphisms between DA è OB and O(A ^ B), and between I and 
OT; in the absence of k and w arrows, which are recuperated similarly, such an 
axiomatization becomes impracticable). 


The equations (b) and (bb) can be replaced by the two equations 


bp g,F(f e(geh))ba so —(feg)eh 
bp er ((f eg)e h)b4 Bc = fe(geh) 


(analogous to (I8) and(In) from Section 1), but (b) and (bb) make it more trans- 
parent we are dealing with a natural isomorphism. Similarly, (c) and (cc) can be 
replaced by 


cp,c(gef)cApB — f e9 


but, again, (c) and (cc) make it more transparent we are dealing with a natural iso- 
morphism. Of course, from (b) and (bb) we obtain immediately 


(f © (g è h))b4 ac = bp,g,r((f£ eg) eh) 


and this equation could replace (b). 
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The equation (b) can be replaced by the three equations 


(b) ((feB)eC)bZ go = bp,ac(f *«(BeC)) 
(b) ((Aeg)*eC)bZ ao 7 bz gc(A*(geC)) 
(bs) ((A e B)eh)bz p.c — bz p, r(4e (B e h)) 


where D e f is 1p e f andf e Dis f è 1p, as we have defined them in Section 1. 
Similarly, (c) can be replaced by either of the two equations 


(cı) (Bef)cA,B = cc,B(f ° B) 
(c2) (g e A)ca,B = cA,D(À e 9). 


It is natural to assume these substitute equations if the unary operations on arrows 
D e . and -e D are primitive instead of the binary operation on arrows . e ., though 
(b) and (c) would do as well. The equation (w) then reads 


(feB)(Ae f)wA = wef. 
From (o6b), which amounts to 
b4LB = ó* ecp 
we can derive (without using c arrows) the following two analogous equations: 


(cb) (CA e 1p)br, 5 = OAeB 
(db) OAeBby’ BI = lA e OB 


(7, VII.1, p. 161 ; Exercise 1]. We call these three equations, and those derived from 
them with (cc*), (88°) and (bb), triangular equations, because Mac Lane assumes 
(cób) for monoidal categories as a triangular commutative diagram [7, VII.1, p. 
159]. 


The equation (b5) is Mac Lane's pentagonal diagram for monoidal categories 
[7, VII.1, p. 158]. We call this equation, and equations derived from it with (bb), 
pentagonal equations. 


The equation (cóc), assumed by Mac Lane for symmetric monoidal categories 
(7, VII.7, p. 180], enables us to define the ô and ôt arrows in terms of the c and ot 
arrows, or the other way round. With this equation, it is superfluous to assume in 
Section 1 the equations (6) and (66*), or (c) and (a0*). From (cóc) and (oc) it 


180 KOSTA DOSEN AND ZORAN PETRIC 


follows that c4 ; = 04,6,, which with (có) and (co*) yields cir = lier. This last 
equation with (cóc) immediately yields (o6). 


The equation (bc6) is Mac Lane's hexagonal diagram for symmetric monoidal 
categories [7, VII.7, p. 180]. It says, intuitively, that permutation of products like 
Ae B incA, p,c can be replaced by permutation of the factors A and B in c A,c and 
cp,c. We call (bc6), and equations derived from it with (bb) and (cc), hexagonal 
equations. 


If we forget about the provisos for modalized objects, the b equations together 
with the e and có equations of Section 1 axiomatize monoidal categories. If to 
that we add the c equations, we obtain symmetric monoidal categories. The clo- 
sure equations of Section 1 transform the former into monoidal biclosed categories 
and the latter into symmetric monoidal closed categories (cf. Section 6). 


The k equations can be replaced by the single equation: 
Forf: AFI, kAc- f. 


At the end of Section 1 we have noted something quite analogous concerning the 
equation ( T). The k equations say that I is a terminal object when we restrict our- 
selves to arrows from modalized objects. Formulating these equations as we did 
makes clearer the parallelism with the b, c and w equations. The equation (k) cor- 
responds to (b), (c) and (w): when written 1rka = kg f, it says that k is a natural 
transformation from the identity functor to the constant functor that maps objects 
into I and arrows into 1;. Of course, k need not be an isomorphism as b and c are. 
The equation (1k) should be compared with 


bzrp = 5% © op and bj; ; = oj eor, which are related to (odb) and (oô), 
CAI = 056A and cj; = lrer, which are related to (cóc) and (oô), 


Wy = ot and w; = 6}, which are related to (ców) and (o6), 


but it may also be compared with (bb) and (cc). 


The equation (w) says that w is a natural transformation from the identity func- 
tor to the square functor that maps objects A into A e A and arrows f into f e f. 
However, w need not be an isomorphism. 


The equation (ców) has ‘ô’ in its name because through (oô) it involves ôr. 
The equations (ców), (bw), (cw), (akw) and (kw) say, intuitively, how ø, ô, b, 
c and k arrows behave if they are composed with w arrows. Though c; ; equals 1;,;, 
the arrow c4,A need not be equal to 14,4; but it behaves like 14,4 if composed 
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with w4. (We can derive cj; = 11.1 from (ew), (ców) and (co*), which is dif- 
ferent from the derivation we gave above.) Similarly, b} 4 4 need not be equal to 
CA, AeA, but from (bw), (ew) and (c) it follows that 


by A. A(14 © WA)WA = CA, AeA(lA € WA)WA. 


From that with (ców) and (ao?) we obtain, however, bri: = €u1e1 (we have said 
already that b; 1 ; is equal to aj € aj). The equation (bw) is obtained by reversing 
the arrows of the first diagram for monoids in [7, VII.3, p. 166, diagram (1)]. 


The arrow 
c7 oc :(AeB)e(C* D) H (A eC) «(Be D) 
for which, with (e), (b) and (c), we can prove 


((f € h) *«(g*3))cA2,B,c,p = €E,Hu,J (f ° 9) © (h o 7)) 


is, like cA, p, a natural isomorphism (the upper index ‘m’ stands for ‘middle’). In 
the equation (bcw8) it enables us, intuitively, to replace contraction of products 
like A e B in wA, gp by contraction of the factors A and B in w4 and wg. In this 
sense, this equation is analogous to Mac Lane's hexagonal equations. We shall call 
(bcw8), and equations derived from it with (bb) and (cc), octagonal equations, be- 
cause the commutative diagram corresponding to 


bz A,BeB(1A * ba, p,p)(1a ° (cn,A e1p))(14 e bg 4.5)D4 p, A.BWAeB = 
| (w4 elpep)(lA e wg) 


has eight sides. There are two main octagonal equations: (bcw8) and 
C4, A,B,BÜWA e WB) = WAcB- 
As (bw) is obtained by reversing the arrows of a diagram for monoids, so 
the equations (okw) and (kw) are obtained from the remaining diagrams for 


monoids in [7, VIL3, p. 166, diagram (2)] by the same procedure (which involves 
replacing o by co* and ô by ó*). These equations can be replaced by the equations: 


Forf: AF B, opg(kae f)wA — f, óp(f eka)wa =f. 


If pap: Ae BF Aand p, 5: Ae BF Aare defined by 


PA,B =af ÓA(la e kp), PA =at 0B(ka e 15) 
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then (okw) becomes PA. AWA = la and (ókw) becomes pA,AWA = 14. It 
follows from (1k) that o4 — Pr. A and ó4 = pa, (cf. Section 4). In the presence 
of c arrows, (ckw) and (dkw) are not independent: one can be derived from the 
other. 


The equation (oô), i.e. c1 = ôr , becomes superfluous in the presence of (k) and 
(1k). Likewise, (ców) follows from (1k) and (ckw). However, these superfluous 
equations may be needed even if we dont have k arrows (see Section 6), and be- 
cause of that we keep them as primitive. In the same spirit, we have assumptions 
for both o and 6 arrows, though these assumptions are not mutually independent in 
the presence of the c arrows, as we have noted above. From the beginning, we also 
have the superfluous equation (e1). 


3 MODAL FUNCTIONAL COMPLETENESS FOR NLO CATEGORIES 


Given an NLU category C and an object A of C, we build the polynomial NLO cat- 
egory C[z] with an ‘indeterminate’ arrow z : I+ A by a procedure described in [6, 
I.5, p. 57]. Namely, we add a new arrow x : I - A to the underlying graph of C 
and then build the NLO category freely generated by the new graph. That this will 
succeed is guaranteed by the fact that NLO categories are equationally presented, 
i.e. that all our assumptions about arrows are equations. We shall not rehearse here 
this procedure, which is explained in sufficient detail in [6]. It is also covered by 
theorems in universal algebra concerning equationally presented algebras with par- 
tial operations [5, Section 5, p. 124, corollary to Lemma 3], and [1, Section 7, p. 
129, Corollary 1 to Proposition 18]. The name 'polynomial category' is explained 
by thinking about the arrows of C [x] as polynomials in x. 


We can now state the central result of this paper: 


MODAL FUNCTIONAL COMPLETENESS THEOREM. 

For every arrow ọ : B + C of the polynomial NLO category C(x] built over the 
NLO category C with x : I+ A, there is a unique arrow f : OA e B F- C ofC such 
that f (3 e 15)o, = ọ holds in C[z]. 


We also have the following: 


COROLLARY 

For every arrow p : I + C of the polynomial NLO category C{x] built over the 
NLO category C with x : I FĀ- A, there is a unique arrow g : DA F- C ofC such that 
gx = q holds in C[z]. 
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If B is I in the Modal Functional Completeness Theorem, then in C[z] we have 


f(z e lijo} = ¢ | 
fé, x = v, with (oô), (6) and (56°). 


So, in the Corollary, we shall take g to be foh 4: Actually, as with cartesian 
closed categories in [6, I.6, p. 61], the Corollary entails the Theorem, since the 
arrows Y : B - C of C[z] are in one-one correspondence with the arrows *(vóp) : 
I+} B >C (or (yog)* :I- C + B). This one-one correspondence also entails 
that our restriction of x to arrows of type I F A is surmountable (cf. [6, I.2, p. 52, 
Exercise 1]). However, the Theorem has a better form than the Corollary for the 
proof we are going to present. The remainder of this section will be devoted to this 
proof of the Modal Functional Completeness Theorem. 


If y ranges over arrows of the polynomial NLO category C[x] built over the 
NLO category C with z : I + A, whereas f ranges over the arrows of type 
OA e BIC ofC, for some objects B and C, and 


fix =at f (a *15)o$ 


then the Modal Functional Completeness Theorem asserts that ‘x is an onto and 
one-one function from the f arrows to the y arrows (from the definition of ‘z it is 
clear that it is a function). Our proof of the Modal Functional Completeness The- 
orem (inspired by the proof of functional completeness for cartesian closed cate- 
gories in [6, I.6]) will proceed by defining a function uz from the q arrows to the f 
arrows and showing that ju, is the inverse of ‘x. Applying Hz to ¢ is related to ap- 
plying the functional abstraction operator Ax. Syntactically, 4x binds the variables 
x that may occur in the polynomial y. Semantically, it extracts from q a function 
that may be applied to x in the sense of ‘: if uz is like functional abstraction, ‘x 
is like application to x. The analogue of zz in [6, L2, I.6] is Krea, whereas the 
analogue of ‘x is (z(Op, 1B). 


We define the function u+ by the following inductive clauses, which cover all 
possible forms an arrow of C [x] may have: 


(u0.1) pox = raAónA 
(u0.2) Forh: D F- E an arrow of C, 


zh =hop(koa e 1p) 
= hpoa,p- 
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(u1) For~: DEF Eandé: EFF, 
Us (£V) = us£(1na * Uz) bo, na, p(Woa e 1p). 

(u2) Ford: Di Eand£: FFG, 

Ux (V e£) = (us V © us&)e A O4p,r(woa * 1 Der). 
(u3.1) Fory : Ee DEFF, 

Ux (V) = "(uz b E p(cg Da *10)bgna,p)- 
(u3.2) Fory : De EFF, 

Us (U^) = (us V bg. p.p) - 
(ud) Forw: Dt Eand£: DEF, 

us (Y, £) = (us, Hag). 


(u5)  Fory: EF D,£: FF Danddg,g, p: Ge(EV F)F (Ge E) V (GeF) 
defined by dG,E,F =df EG (GeE)v(Ger) (le d l'&GeE,GeF;' KGE GeF]): 


Bx v^, €] = [Ls V, Ds e]dna,z,r- 


(u6) Fory: DF E with D modalized, 
us (I9) = (us). 
Let us first deduce that the following equations hold in C: 
(u1.1) Fory : DF Eand h : EF F an arrow of C, 
us (hp) = hust. 
(u1.2) Forh: Dt Eanarrowof C and £: EF F, 


pa (£h) = Ls £(1nA bd h). 
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We could as well have taken (1.1) and (11.2) as clauses in the definition of uz, 
but then we would have to show that they are compatible with (11) and (10.2), and 
this compatibility is demonstrated by deducing them from (11) and (10.2). For 
(41.1) we have 


Us (h))- hoe(koa e1g)(1na e Heb) BG, n4. p(Woa *lp) with (ul) 
and (40.2) 

= hag(lre uzv)(kaa ° load) bG,4 n4, p(Waa4 ° 1p), with (e) 

= husbooaepby na p((kaa e1n4) elp)(wnpa elp), with (c) 

and (b) 

= husv((ea4(ka4 e 1n4)waA) è 1p),with (cb), (bb) and (e). 


Then we apply (okw) and (e1). For (11.2) we have 


us (Eh) us£(1n4 © (hap(kaa *1p))bis na p(wna e 1p), with (ul) 
and (0.2) 

= Is£(1nA e h)Qoa e a p)b54.p((1na e kga) e 1p)(wpnaA e 1p), 

with (e) and (b) 

= ps £(1n4 eh)((óní(1n4 ° koa)woa) e 1p), with (OR ey 

and (e). 


It remains to apply (ókw) and (e1). 
In a rather similar manner, with the help of (10.2), (okw) and (ókw), we derive 
from (u2): 
(u2.1) Forw: DEF, 
pa (E ep) = (Ee pu. )bEnA p(cna,g e Dba, g.p- 
(u2.2) Forv : DE F, 
us (b © E) = (ust $ E)big D,E- 


In these clauses we write De f for 1 pef and fe D for f el p. If the unary operations 
on arrows D e _ and . e D are primitive instead of the binary operation on arrows 
- * ., then we can derive (12) from (12.1) and (u2.2). These substitute clauses, 
which are often simpler to work with than (52), will also serve for the results of 
Sections 5 and 6. 


Then we have to check that uzh is well-defined for arrows h in C. For example, 
we must check for h = ts,s: Dt Eandt: E F- F, that an equation correspond- 
ing to clause (51), namely, 


hptj4.p = tpoa.e (loa . (sPn4 p))bGa n4, p(Woa elp) 
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holds in C. This amounts to the deduction of (41.1). We have to check similar 
equations corresponding to clauses (12)-(16). We shall not go into the details of 
this lengthy, but rather straightforward, exercise. However, let us note as a hint that 
it may be easier to check such equations with D e _ and _e D, rather than with _e _. 
In that case, instead of (12) we use (42.1) and (2.2). Let us note as another hint 
that when we check the equation corresponding to (u5), we use the fact that the 
distribution arrow dg, g, p is an isomorphism (actually, a natural isomorphism), its 
inverse being [lg e &g,r, lG e Kp Fl- 


Next we check that jz, is indeed a function: 


LEMMA 1. [fy = v holds in C[zx], then psp = uz v holds in C. 


Proof. From the inductive definition of jz, it follows immediately that if uzp = 


Lc holds in C, then pz (£p) = pus (£V) and pz (pE) = nu; (V£) hold in C. We have 
analogous implications for the other operations on arrows of NLLI categories. 


If p and w are arrows of C and y = v holds in C[z], then o = v holds in C. So, 
we have in C 


Quazlp = Vuszlp 
He? = pa V, by (41.1) and (cat 1) 


(we could as well have used (11.2)). 


It remains to check that for all the equations y = y we have assumed for NLO 
categories, in which arrows of C[z] not in C may occur, uz = Uz holds in C. 


The equations of (cat 1) are covered by (1.1) and (11.2). For (cat 2) we have 
ps (£p) 


= us£(1na * (us (loa * us e)bGA4 n4, s(wna *18)))bG4 na, s(wna e 1B) 
= pe€(loa e usV)(1aa e (loa * ui«o))(1na * bga n4, s) PO, naAmA,5 
((1n4 * Wo) * 15)(wnA * 15), with (e) and (b); 


then we have the pentagonal equation 
(1n4 e b54.D4,B)PGA,DA«DA,B = bo4,04,0408 PO 4004,04,8(bo4.04,0A4 elg) 
which with the help of (bw) and (b) yields 


Us(£(Vv)) -us£(1na * Heb) bay n4 c(1na ma * us v)(waa * 1na.n) 
b5,4.04,B(WOA e 1g) 


= uz ((Ep)¢), with (e). 
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For (e) it is again easier (and more instructive for what we do in Sections 5 and 
6) to work with D e _ and . e D instead of _e _, i.e. with (42.1) and (2.2) instead 
of (u2), and check (e2) and (e bifunctor). For the first equation of (e2) we need 
pentagonal and hexagonal equations, whereas for the second we need only a pen- 
tagonal equation. For (e bifunctor) we need (cw) besides pentagonal and hexagonal 
equations in order to check that 


co, ,A1,UA A2 PO Ae As, DA,A2 (cna ,DAeA, 9 A»)bpg4 ,D1Ae A1 A2 (0A . ba, A1 Ay) 
boa, DA,A41*A2 (woa @ (41 è A2)) = woa e (Ai © A2). 


For (c) and (ô) we apply (11.1) and (41.2). Among the remaining equations for 
NL categories that need checking, namely, the closure and bicartesian equations, let 
us consider (L). 


For o : L | B, we have to show uztB = Hz. With (L) we have 


*(UzlB) = loasB 
"(ustB) = *(Uz¥) 
€oaoa—eloa e*(uz.ig) = €g4.o0a—sB(loa e "(uzq)) 
UclB = Uz, with (> £). 


A similar trick involving * is applied when we show that the distribution arrow d 
is an isomorphism, and this we need when we check the lemma for (V7). 


It remains to check (C38), (0n), (b), (€), (k) and (w). The cases with (L18) and 
(On) follow readily by applying (51.1) and (16). For the case with (b) it is eas- 
ier and more instructive to break the checking into checking (bı), (b2) and (b3) 
with the clauses (22.1) and (12.2). Then for (bi) we need the pentagonal equa- 
tion (b5) (see the justification of pentagonal equations in Section 5). For (b2) we 
use two pentagonal equations, and for (b3) one hexagonal equation besides three 
pentagonal equations. To check (c1) or (c2) we need a hexagonal equation (see the 
justification of hexagonal equations in Section 5). For (k) we just use the fact that 
the object I is terminal for arrows from modalized objects. Finally, for (w) we have 


Hs((p ° p)wB -(use*use)ctina, p,sWaA * WB), 
with (12) and (21.2) 
= (usq © Uc P)WoOAeB, With an octagonal equation 
= us (woeo), with (w) and (1.1). 


The following two lemmata assert that jz is the inverse of ‘x. As pz corre- 
sponds to functional abstraction and ‘z to application, so Lemma 2 corresponds to 
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B-conversion and Lemma 3 to 7-conversion. 


LEMMA 2. Foro: B - C an arrow of C[z|, the equation (uzq)'x = q holds in 
C[z]. 


Proof. We proceed by induction on the complexity of y. For the basis we have: 


(0.1) If is x, then 


(u.z)zr = rAón4(z9 e 11)oi, by definition 
= rar roi, with (6) | 
= 2, with (C18), (có) and (cc*) 


(0.2) If y is an arrow h of C, then, by definition, 


(Lu, h)'z = hog(ko, e 15)(zP e 15)o5 


and we use (e), kgaz? = 1; and (co?) to show that the right-hand side is equal 
to h. 


In the induction step we shall only consider the following case as an example: 


(1) If p is £y, with y : BF E and £: EF C, then by the induction hypothesis 
we have (u,V)'z = w and (nu, £)'z = E. We need to prove (u,(£U))'z = 
£v), which amounts to ui, £(114 euis )brj4 n4 p(WoAels)(tlelp)o% = 
Bs E(xM e 1g)oi uv (x o 15)05. For the left-hand side (lhs) of this equa- 
tion we have 


lhs  -ps.&(1nAe us V)b5a n4, a (27 ex”) 


eip)(wielnp)o S, 
with (e) and (w) 
= us£(1na4 © ua) (zB e (z? e15))bfi s(wi e 15)o 5, with (b) 
= pa£(zP elg)(lie (usi (zP e15))bins(ciels)cb.  — 
with (e), (ców) and (co*) 
which, with (c) and a triangular equation derived from (cb) with (co*) and 
(bb), is equal to the right- hand side. 


As a lengthy exercise, it remains to check cases corresponding to clauses (u2)- (116) 
(for ease, it may be preferable to work with (12.1) and (112.2) instead of (12); see 
the justification of (cb) and (cób) in Section 5). a 


LEMMA 3. For f : DA e B - C an arrow ofC, the equation u,(f‘x) = f holds 
in C. 
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Proof. By using (1.1), (41.2), (42.2), (u6) and (10.1) we obtain 


He(f'r) = f((raónA)P e1p)bg, g(1na © oh) 
= f(ónA* 1p)bG,1e(o4 e 05), with (On); 


with the triangular equation (db), (e) and (aa*), the right-hand side is equal to 
f. = 


As a corollary of Lemma 2 we can easily: obtain a more general statement; 
namely, for every arrow a : I - A of C[z] the equation (u4,q)'a = pē holds in 
C[z]. In this equation, which corresponds exactly to G-conversion, f‘a is defined 
analogously to f‘z, and yz is obtained from ¢ by uniformly substituting a for z. 
We shall not define this substitution with more precision since we don't need the 
corollary here. Lemma 2 suffices; i.e., Lemmata 1—3 immediately give the Modal 
Functional Completeness Theorem. 


4 FUNCTIONAL COMPLETENESS FOR CARTESIAN, CARTESIAN 
CLOSED AND BICARTESIAN CLOSED CATEGORIES 


An NLLI category in which every object A is isomorphic with DA is a cartesian 
category with respect to e and I, a cartesian closed category with respect to e, > 
and I, and a bicartesian closed category with respect to e, +,I, V and L. From our 
axiomatization of NLO categories in Sections 1 and 2 one can easily extract non- 
standard axiomatizations of these sorts of category by selecting the assumptions 
tied with the mentioned operations and objects (of course, we always assume that 
we have the 14 arrows, composition, (cat 1) and (cat 2); i.e. that we are in a cate- 
gory). 


As a matter of fact, to axiomatize bicartesian closed categories in such a non- 
standard way, all we have to do is forget about O in Section 2, without making any 
selection among our assumptions. Namely, let us assume whatever is assumed in 
Section 1 for NL categories, and let us, moreover, assume the structural arrows b, 
c, k and w without provisos concerning modalized objects. We forget about the r 
arrows and the operation on arrows Ü- the arrowr A may be identified with 14 and 
fË is just f. For the structural arrows without provisos we assume the equations 
of Section 2. 


Remember that we have defined the p and p' arrows at the end of Section 2 by 


PA,B —at ÓA(14A e kg), Pp —at C B(kA *1g). 
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In the same spirit, we define the binary operation on arrows 


f:CFA g:CKB 
Ug: CF Ae B 


by 
159] =ar (f * g) wc. 


Then we can show first that (pA, p; p; g) is a natural isomorphism from A e B 
to A ^ B, its inverse being (74,5, 74 p}. The essential step in checking 


{TA,B, A.B) (DA, B; PA B) = l4,B 
is to show {PA,B; PA Bt = l4ep, i.e. 
(cókw) ((óA(1A e kp)) e (cB(KA e 1p)))waA.B = lAeB. 


In this equation is hidden the octagonal principle: with an octagonal equation, the 
left-hand side is equal to 


(0A e c B)CA 11 B((1A ek,) e (kg elg) (wa owp) 


which with CA LI, B = 1(Acl)o(IeB) (an equation related to cy; = 11.1, mentioned 
in Section 2), (6kw), (okw), (e) and (e1) is equal to 14, p. 


We can also prove equations that correspond exactly to (A8) and (^q): 


(8) PAslfsg)2h Paalfgh=g 
(en) For À : C Ae B, {pa,Bh, p^ gh) =h. 


For (e) we use (cókw). Note that, with (e), the equation (cókw) yields also 


UPAB;gPA,B) - f eg. 


To show all that we don't need >, +, T, V, L and the assumptions tied with them. 
The equation last displayed permits us to give (+ 8) and (+ 7) the form they 
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have in [6, I.3, p. 53, E4a, E4b]. Of course, we can analogously rewrite (> £) 
and ( 1). 


Next we show that 7; is a natural isomorphism from I to T, its inverse being kr. 
For that we need only ( T) and the k equations. 


Finally, we show that we can keep — and reject «—; actually, it does not mat- 
ter which one of the two implications we keep while rejecting the other. This fol- 
lows from the fact that (e7 Bc A—B,A)* is a natural isomorphism from A —> B to 
B + A, its inverse being *(€4 gCA,B— A). To demonstrate that, we need only (e), 
closure equations, (c) and (cc). 


So we have indeed an axiomatization of cartesian closed categories, which, given 
that we also have the assumptions for V and .L, amounts to an axiomatization of bi- 
cartesian closed categories. We can forget in this nonstandard axiomatization about 
+, ^, T and the assumptions tied with them. 


Our proof of the Modal Functional Completeness Theorem then yields an alter- 
native proof of ordinary, nonmodal, functional completeness for bicartesian closed 
categories. We only have to forget about O. That means that in the statement of the 
Functional Completeness Theorem we shall have f : Ae B l- C and f(xelg)o% = 
o, i.e. f'zis f(ze15)o*, (cf. the Substructural Functional Completeness Theorem 
in Section 6). In the definition of jz, in clause (1/0.1) we shall have 


Bst =14ô4 
= ÓA 


whereas in the other clauses we just delete O wherever it occurs (clause (6) is 
omitted). Then it may be checked that our new pz may be identified with Krea of 
[6, I.2, p. 51]. For example, since 


Krea (EY) = KxeAE (TA, D, Keeap) 


the connection with clause (51) in the new definition of Hx is achieved by verifying 


(l4 * uz V)b4 4 p(wae lp) = {Pa,D, is V] 


which is done with the help of an octagonal equation. 


It should be clear that we also have an alternative proof of ordinary functional 
completeness for cartesian and cartesian closed categories. As we have already re- 
marked, from our nonstandard axiomatization of bicartesian closed categories we 
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obtain a nonstandard axiomatization of cartesian closed categories by rejecting the 
assumptions tied with the bicartesian equations. For cartesian categories we reject 
moreover the assumptions tied with the closure equations. In our nonstandard ax- 
iomatization of bicartesian closed categories, the essentially nonstandard part is the 
cartesian part, which is given by the operation e on objects, the object I, the ar- 
rows 14, the có arrows, the structural arrows b, c, k and w without provisos for 
modalized objects, the operations composition and e on arrows, and the equations 
between arrows: (cat 1), (cat 2), the e equations, the có equations and the b, c, k 
and w equations. 


In the same style, we can make as nonstandard the axiomatization of the part in- 
volving the coproduct V and the initial object L in bicartesian categories. 
Namely, we would have special arrows analogous to the od, b and c arrows in which 
eis replaced everywhere by V and I by L. Instead of k and w arrows we would have 
ta: 1+ Aandw\ : AV AF A. We would assume for these arrows equations 
analogous to the e equations, the có equations and the b, c, k and w equations. 
In the equations involving w” and : the order would have to be reversed and the 
analogues of ø and 6 replaced by the analogues of g? and ôt (the equations corre- 
sponding to the k equations would state that L is an initial object; we have written 
down these equations at the very end of Section 1). 


In our nonstandard axiomatization of cartesian categories there are redundancies 
among the special arrows (there are also redundancies among the equations, as we 
have noted at the end of Section 2; however, (e1) is not redundant now: we use 
it to derive (odkw)). Actually, since the definition of p is given in terms of 6 and 
k, the definition of p’ in terms of o and k, and the definition of the curly brackets 
in terms of w, we can keep only the special arrows 1,4, 0,6, k and w, and define 
all the others with p, p' and the curly brackets, as this is done with the standard 
axiomatization of cartesian categories. Namely, we would have 


0*4 —ar {ka, 14}, 6% =ar {1a, ka} 

bz pc 7at {{PA,BeC, DB,CPA, pec); PB, OPA, BoC} 
b4 a.c —df {PA,BPAcB,C; {P'4, BPAcB,C, Preach} 
CA,B —df {P4 5; PA,B}- 


The equations behind these definitions all hold in our nonstandard axiomatization 
of cartesian categories (for the first two we use (ckw) and (dkw), whereas for 
the last three we use octagonal equations). Note that the last definition says that 
in Gentzen's sequent systems we can derive permutation from thinning and con- 
traction, provided we are allowed to contract sequences of formulae, rather than 
single formulae only. Here is a permutation obtained by two thinnings followed by 
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a contraction: 


B,AKC 
A,B,A- C 
A,B,A,BEC 
A,BEC 


To obtain cartesian categories with this reduced stock of special arrows we have 
to assume (cat 1), (cat 2), (e), (c), (6), (k), (1k), (w), (okw), (kw) and (cókw). 
The octagonal equations enter into this axiomatization via (cókw). By the way, 
the c™ arrows have the following symmetrical definition in terms of p, p' and the 
curly brackets: 


CA B,C,D “df . 
I / 
{{PA,BPAeB,CeD; PC,DP'40B,CoD}: [PA,BPAeB,C» D; PC DP AeB,CeD}}- 


We cannot economize similarly on o*,ó*, b and c arrows with the restricted, 
modalized, versions of the arrows k and w in Section 2. With the definitions of 
ot, ô, b and c given above, o% and 5% would be lacking if A is not 
modalized, whereas b? 5 o, b4 p c and c4,p would be lacking if A, B and C 
are not all modalized. For example, w 4,5 hidden in the curly brackets of (p^4 p, 
pa,B} is not available if A and B are not both modalized, whereas in Sections 2 
and 3 we need c A p even in cases where only A or only B is modalized, and simi- 
larly with b arrows (the arrows o% with B not necessarily modalized are involved 
in the formulation of the Modal Functional Completeness Theorem). 


Another axiomatization of cartesian categories may be obtained by taking the 
special arrows p or p' as primitive instead of k. With p primitive, we define k 4 as 
p1,40*, and p'4 p as pp,ACA,p. A further possibility is to define 64 as pA, and 
C A a pA ICI, A. We shall not investigate here what reshuffling of our equations this 
change of primitives would require. Let us only note that in Section 2 we may have 
taken pA,g as primitive instead of k 4 provided B is modalized. Still another pos- 
sibility is to take the curly brackets operation on arrows as primitive instead of the 
w arrows and define w 4 as (14,14). In Section 2, we would have to require that 
with f : C - Aandg : C F B we have ( f, g} only if C is modalized. However, 
as we have explained in the previous paragraph, having p and these restricted curly 
brackets primitive would not permit us to economize on b and c arrows in Section 
2. 
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5 NECESSITY OF ASSUMPTIONS FOR NLO CATEGORIES 


We have seen that what we have assumed for NLO categories is sufficient to de- 
monstrate modal functional completeness. We want now to address the question 
whether these assumptions are also necessary. This is not a question we can an- 
swer in an absolute sense, but only relatively to some presuppositions. These pre- 
suppositions are contained in the particular notion of category without functional 
completeness that we extend to obtain a notion of category with functional com- 
pleteness, as our notion of NL category was extended to the notion of NLO cate- 
gory. But, foremost, the exact form of the functional completeness theorem carries 
presuppositions about what are polynomial arrows q and about the type of the ar- 
row f (the arrow f is of type ODA e B F- C rather than Be ODA F C, or some other 
type). Moreover, we require that a particular function from the f arrows to the y 
arrows be onto and one—one. We shall see in the next section that we may under- 
stand functional completeness in different ways by restricting the notion of poly- 
nomial (still another way to restrict functional completeness is briefly mentioned 
in the concluding section). 


For the time being we assume we have NL categories, axiomatized as in Sec- 
tion 1, and we shall try to see to what extent the additional assumptions for NLO 
categories, in Section 2, are necessary for proving modal functional completeness 
as this is done in Section 3. After that we shall try to see whether at least some 
assumptions about NL categories in Section 1 are necessary in the same sense. 


That in NLO categories we must have the special arrows r, the modalized struc- 
tural arrows b, c, k and w, and the operation on arrows O follows, as explained in 
[3], from the Modal Deduction Theorem and its converse, which are both conse- 
quences of the Modal Functional Completeness Theorem. The Modal Deduction 
Theorem says: 


(O |) For every arrow o : B F- C of the polynomial NLO category C[z] built 
over the NLU category C with z : I + A, there is an arrow f : OAe BFH C 
of C. 


It is clear that to prove (O |) it is enough to take for f the arrow pz. The converse 
of the Modal Deduction Theorem says: 


(Of) For every arrow f : DA e B+ C of the NLO category C, there is an 
arrow y : BF C ofthe polynomial NLO category C[z] built over C with 
z:IF A. 


It is clear that to prove (O 1) it is enough to take for the arrow f‘z. It is also 
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clear that though we may infer (O |) and (O f) from the Modal Functional Com- 
pleteness Theorem, this theorem does not follow from (O |) and (O f) alone. In 
(O |) and (O 1) nothing is said about jz, and ‘zx, and their functional character. 


Although this is something proven in [2, 3], we shall show again in more detail, 
and in a way adapted to the present context, how (O |) and (O f) deliver the special 
modal arrows and the operation on arrows - of NLO categories. For that it will be 
useful to have the abbreviations: 


Forg: CF A2 B, ?g —at £4 p(la eg). 
Forg: CF A+ B, g? —ar €4 (la * 9). 


r) SinceinC[z] we have z : It A, by (O |) we must have f :OAeIt Ain 
C, and we take fd}, to be ra. 


b) Since in C we have 
lgasnyc! : DAe B H ((DAe B) eC) HC 
by (O 1) we must have in C[x], where x : It A, 
vc: BF (Ae B) e C) — C 
and hence also qo? : Be Ct (LIA e B) eC. Then, by (D |), we must have in C 
f: GBAe(BeC)F (DAeB)eC 


and we take this f to be b5, 5 c. 


Since in C we have 
"(L4 npyec" Ong: OB e I- A 2 (((AeOB) eC) — C) 
by (O 1) we must have in C[z], where z : IF B, 
qe :IF AA ((AeOB)eC) — C) 


and hence also *((*q6*)?) : C - A + ((AeOB) eC). Then, by (O |), we must 
have in C 


f: BBeCF- A— ((AeOB) eC) 
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and we take ° f to be bas c. 


Since in C we have 
*luOByeacónc : OC e IF- (Ae B) > ((Ae B) e OC) 
by (O f) we must have in C[z], where x : IF C, 
e :IF(AeB) > ((Ae B) eC) 


and hence also *(*(*(5*.,5)ópg) : I- B 2 (A — ((A è B) eOC)). Then, by 
(O |), we must have in C 


f:BCeI- B— (A ((AeB)eGQC)) 


and we take ??( f Shc) to be b2’ g mc- We proceed analogously for the b“ arrows. 


c) Since in C we have 
'1p.nasna : DA e IF B 2 (BeDA) 
by (D 1) we must have in C[z], where x : I A, 
p:IF B> (BeDA) 


and hence also (°yôhop)* : 1+ (B è DA) + B. Then, by (O |), we must have 
in C 


f: BAeIF(BeLDA) — B 


and we take (f ôH a): to bec, og. We proceed analogously for c4 oB. 


k) Since in C[z] we have 1;, by (O |) in C we must have f : OA eI F I. We 
take fd}, to be kga. 


w) Since inC we have 6q,, by (O f) we must have y : IF DA in C[z], where 
z:It A. Then in C[z] we have (y e y)of : It (DA e DA), and by (D 4), inc 
we must have f : DA eI H- (DA e DA). We take fó&, to be wga. 


0) Since in C we have dqa, by (O 1) we must have y : I- OA inC [x], where 
z:IF A. We take this y to be zH. So we have H applying to arrows of type I F- A. 
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We want to show that we have also Y applying to arrows of type B F A where B 
is any modalized object. 


Since in the polynomial NLO category C[x][y] built with y : I F B over the 
polynomial NLO category C[x], which was itself built over the NLO category C 
with x : IF A, we have 


((zey)oi)H : IF O(A e B) 

by (O 1) we must have g : OB eI - O(A e B) in C[x], and hence also 
goon : OB F O(A e B). 

Then, again by (O |), we must have in C 
f:OAeOBEO(Ae B). 


We also have 1! in C. 


Suppose that we have in C an arrow h : OC FĀ A, and hence also hégg. Then 
by (O f) we must have o : I - A in C[z], where z : I - C. Hence we have yU 
in C[z], and by (O {) we must have f : OC eI +t OA and fó&o : OC H OA 
in C. Taking fo5c to be AY, we have H applying to arrows of type OC + A. To 
have full Hl, applying to arrows of type B + A where B is any modalized object, it 
remains to show that for modalized B we have in C an arrow of type B F OB, and 
this we do by induction on the complexity of B, using the arrows and operations 
on arrows we have already secured. 


The fact that for modalized A we have an arrow of type A + DA, as well as rA, 
gives us the structural arrows b, c, k and w with the provisos involving any modal- 
ized objects, as we have assumed them in Section 2, and not only boxed objects, as 
we have inferred them above. 


Now we shall justify to a certain extent the equations of Section 2. In this justifi- 
cation we presuppose the equations (b), (bb), (c), (cc), (k) and (w); i.e., we presup- 
pose that the b and c arrows are natural isomorphisms and that the k and w arrows 
are natural transformations. We also presuppose the equation (2), which is of the 
same sort as (b), (c), (k) and (w). Proof- theoretically, it amounts to permuting the 
rule with cut. We presuppose that modalized objects A are isomorphic with ODA; 
i.e., for modalized A we presuppose the equations 


ralgd=iy,  19r4 = 10. 
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These equations are of the same sort as (bb) and (cc). They amount to the conver- 
sion of some detours in proofs. Furthermore, we presuppose that the definitions of 
"p and us are given and that pz satisfies also clauses (41.1), (1.2), 
(2.1) and (u2.2). Finally, we presuppose Lemmata 1-3, i.e. that 14, is a function 
and that it is the inverse of ‘x. Our derivation of the remaining equations of Section 
2 from modal functional completeness will depend on all these presuppositions. 


In some cases our derivation will fall short of obtaining the required equation in 
full generality in which it holds. This will happen with the pentagonal equations, 
the hexagonal equations, the triangular equation (cÓb) and the equation (cóc). For 
them we shall push our derivation only up to instances of these equations where 
particular indices are modalized. We shall indicate at what places other instances 
are required, if such is the case. 


First we derive (C18). As a consequence of Lemma 2 (see case (0.1) in the proof) 
we have 


(Opr) rar} =r. 
Then for f : B - A with B modalized and y : I - B we have 


rA fy =ra(fy), with (P) 

fAf y = fy, with (O62) 

uy (rA f" y) = ny(fy), by Lemma 1 
rAf"rpónp = friónp, with (u1.1) and (u0.1) 


which | with (65) andrg19 = 1g yields (C18). The equation (On) amounts to (H) 
and 19 ara = loa, which we have presupposed. 


Next we justify the pentagonal equations. From Lemma 1 and from the equation 
(b), more precisely (b1), it follows that we must have 


Bs (((9 € 1c) e1p)bg c.p) = us(bgc,p(V e (1c e1p))) 


which by using (11.1), (1.2), (u2.2) and (b) gives 


((Lz e lc)e 1p)(bg, p.c * 1p)br4 Bec,p(loa * Pgc,p) = 
((uzpelco)e Lp) ba asp, C, pb, B,CeD' 


Then we substitute 1p A, p x fory and, by Lemma 3, obtain the pentagonal equa- 
tion (b5) with A boxed. Since a modalized object A is isomorphic with DA, we 
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have the pentagonal equation (b5) for modalized A, too. This is not the full pen- 
tagonal equation (b5), because, in the full one, A need not be modalized provided 
two objects among B,C and D are modalized. Other forms of pentagonal equa- 
tions, not covered by (b5) with A modalized, are involved in the proof of Lemma 
1 when we derive the equations obtained by prefixing 14, to the two sides of (b2) 
and (b3), instead of (b; ) as above. 


At this place we can make the following remark. By going carefully over our 
proof of the Modal Functional Completeness Theorem, one finds that we never 
need the arrows b g c and b] 5 œ where C is modalized, but only where A or 
B is modalized, except when in the proof of Lemma 1 we derive the equation ob- 
tained by prefixing jz, to the two sides of (b3). This has to do with the fact that 
[icy is taken to be of type DAe BE C, rather than B e DA H- C. We might as well 
have defined an analogous uzy of this other type. Then for the first index of the b 
arrows and the equation (bj) we would have the same thing that we have now for 
the third index and (b3). However, with our definition of 1j, , we could not exclude 
the b arrows in which only the object in the third index is modalized because these 
are definable as follows in terms of ¢ arrows and the remaining b arrows: 


bz p.c ^at CC,AeBbo, A, B(CA,c * 15)b 4. c n(14 * CBC) 


where only C is modalized. The equation installed by this definition is a hexagonal 
equation. 


Next we justify the hexagonal equations. From Lemma 1 and the equation (c), 
more precisely (c1), it follows that we must have 


Ax ((V € 1c)co,B) = us(co,p(1lo e v)) 


which by using (1.1), (441.2), (2.1), (42.2) and (c) gives 


(us V @le)bG4 B o(l04 *cc,8) = (HY lo)ecosesbog, g(cua,c * 
15g)bn4oc,p. 


Then we substitute 1534, p'x for Y and, by Lemma 3, obtain a hexagonal equa- 
tion. The same hexagonal equation is induced by prefixing Hz to the two sides of 
(c2). As before, OA can be replaced by modalized A. This doesn't yet amount to 
the full hexagonal equation (bc6), because, there, A need not be modalized if C 
is (it is not enough that only B be modalized). A hexagonal equation where only 
C is modalized is involved in the proof of Lemma 1 when we derive the equation 
obtained by prefixing ju, to the two sides of (b3). Such an equation is also installed 
by the definition of b p ç where only C is modalized, which we gave in the pre- 
ceding paragraph. 
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To derive the octagonal equations it is enough to consider the case with (w) in 
the proof of Lemma 1 and proceed as for the pentagonal and hexagonal equations. 
The octagonal equations are completely justified by that (unlike the pentagonal and 
hexagonal equations, whose justification we have pushed only up to a point). 


Let us now justify the triangular equations and (cóc). As a consequence of 
Lemma 2 we have that 


Bs (V € lg)'z = (ury'r) elg 


which, by substituting 1514, p'x for y, and by using (42.2) and Lemma 3, reduces 
to 


br, pelt" elp.z)gob.g = (22 elp)sp)elg | 
((zH e 1p)e lg)brp.EO Dez = ((24 e 1p) e1g)(oi) elg), with (b) and (e). 


By prefixing pz, to the two sides of the last equation, as Lemma 1 allows, with 
(u1.2), (u2.2), (46), (0.1) and (On) we obtain - 


(((ón e 15)bgj4 ;, p) * 1e)bgA e p,e(1na * (bip go Der) = 
(((»hhuwa e 1p)br, 1 p) ° 1e) bg ten e(loa ° (o * 1g)). 


With (66*), (bb) and (e) this reduces to. 
loa * (bi'p,ga peg) = loa * (op elg) 

from which, by taking A to be I and by using (e) and rr1py1U = 1), we obtain 
l1 è (bp,go peg) = l1 ° (ob elz). 


By prefixing o(1.p)ex to both sides, with (c) and (og?) we derive the triangular 
equation 


(c*b) bry, E% Dek =aopelg 


which, with (oo?) and (e), amounts to (cb). (The triangular equation (ôb), in 
which, however, A is modalized, is derivable by prefixing jz, to the two sides of 


(5).) 
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As a consequence of Lemma 2 we also have 


Bz(lg e V)'z = lge(nuzvV'z) 


which, by substituting 1r pz for y, and by using (42.1) and Lemma 3, reduces 
to 
b? o1p(co,e e 1p)bĝi £ p (2 e lied) E. D —]lge ((zP e 1p)o5) | 
(le e (z9 e 1p))bEp(cuk e 1D)bI E, DI E.D = (le e (a e 1p)) (1g e on), 
with (b), (c), and (e). 


By prefixing Hx to both sides and proceeding as in the last paragraph, with (c*b) 
and (e) we obtain 

bey p((c1,e0%) e1p) = 1g eof 
which, with (cc*), (bb), (cc) and (e), amounts to 

(ccob) ((cECE) e Ip)bg; p = lE ° øp. 


This would be the triangular equation (cób) if we had ogcg, = dp, i.e. (cóc). 


Now, let us see what we can do for (cóc). From Lemma 3 (see the proof of the 
lemma) we can derive 


(Óóng bd lp)bügip = lge *op 
óng * lp = (ecpecag,) * 1p, with (acob) and (bb). 


By taking D to be I and prefixing dp to both sides, with (ô) and (060*) we obtain 
ÔDE = 9DECUOE,- 


So, we have (cóc) for modalized A. 


In our proof of modal functional completeness, we actually dont need (cób) and 
(cóc) except when A is modalized, provided we have assumed other triangular and 
pseudotriangular equations, like (cb) and (oco b). However, such a reduced stock 
of triangular and pseudotriangular equations works only because we have taken 
Liz to be of type DA e Bt C, rather than B e DA FĀ- C. With this other type we 
would need other equations. (With the first type, c is preponderant over 6; with the 
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second, it is the other way round.) Our tidier, more symmetric, axiomatization of 
NLO categories, in which (cób) and (cóc) hold for nonmodalized A too, is insen- 
sitive to this change of type for zz. It works equally well with either type. This 
slight generalization has repercussions on the underlying nonmodal part of NLO 
categories, i.e. the NL part, because o and ô, which were not linked in it, are now 
linked through c. Formerly independent assumptions for ø and 6 can now be de- 
rived from each other, as we noted in Section 2. 


To derive (1k), we have as a consequence of Lemma 2 (see case (0.2) in the 
proof) 


ligi (km e 11) (3D e 1)od = 1i 


and then we use (oô), (6), (co) and (k). 


It remains to derive (ców), (bw), (cw), (skw) and (kw). We derive (ców) 
from the case we have considered in the induction step of the proof of Lemma 2 
(case (1)), together with equations we already have, in particular (cb), which we 
have derived in full generality. We derive (bw) by prefixing jz, to the two sides 
of (cat 2) and using equations we already have, in particular a pentagonal equation 
with A modalized (see the proof of Lemma 1). Similarly, we derive (cw) by pre- 
fixing Hz to the two sides of (ebifunctor) and using equations we already have, in 
particular pentagonal and hexagonal equations with A modalized. Finally, to de- 
rive (akw), take y to be 1rjA,1'2 and h to be 154,1; in the last equation displayed 
in our derivation of (j1.1) in Section 3, so as to obtain 


(oo4(Koa e 1nA)wmA) e 11 = 1p, e 1r. 


Then we prefix ôn a to the two sides of this equation; with (ô) and (66*) this gives 
(akw). (We could as well have taken w to be dg 4‘x). For (kw) we proceed anal- 
ogously using the last equation displayed in the derivation of (41.2). Note that we 
already have all the equations used in the derivations of (u1.1) and (11.2) up to 
the last displayed equations; in particular, we have (cb) and (cób) with A modal- 
ized. (We use (ckw) and (ókw) also in the derivation of (12.1) and (42.2) from 
(u2).) With these derivations, (1k), (ców), (bw), (cw), (ckw) and (ókw) are 
completely justified, and we have accomplished our partial justification of the as- 
sumptions made for NLO categories in Section 2. 


Can we justify in a similar manner the assumptions of Section 1? The exist- 
ence of an operation e on objects and arrows follows from the formulation of the 
Modal Functional Completeness Theorem. That there is a functor behind these op- 
erations is something we have to presuppose, as we presupposed about the struc- 
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tural arrows that they are natural isomorphisms or natural transformations (proof- 
theoretically, the equation (e) amounts to permuting the rule e with cut). The as- 
sumptions concerning e can be understood as structural assumptions, too. Such are 
also the assumptions about the arrows 14 and composition. (The equations (cat 1) 
imply that the arrows 14 are natural isomorphisms from the identity functor to the 
identity functor; proof-theoretically, these equations amount to eliminating some 
cuts, whereas (cat 2) amounts to permuting cut with itself.) 


Since they involve the modalized object I, and are related to structural rules, the 
có arrows might be understood as modal structural arrows, on a par with b, c, k and 
w arrows. So, perhaps the assumptions concerning có arrows could be shifted to 
Section 2, and would need to be justified as much as the modal assumptions of that 
section. (We put, however, có arrows in Section 1 because they don't involve O, 
and because I makes sense in the absence of O, too.) The arrows øg? are needed for 
the definition of the function ‘z (they are in the formulation of the Modal Functional 
Completeness Theorem), whereas c and 6 arrows are needed for the definition of 
Hz: the first for (440.2) and the second for (10.1). The ó* arrows are not absolutely 
needed (as we have remarked above, o is preponderant over 0), but this is only be- 
cause we take 4, to be of type 0A e B | C, rather than B e DA H C. The 
equations (c) and (6) are comparable to (b), (c), (k) and (w), whereas (cc*) and 
(66°) are comparable to (bb) and (cc). They have the same proof-theoretical im- 
port. We presuppose these có equations as we presupposed the corresponding b, 
c, k and w equations earlier in this section. 


We can, however, derive (od) from modal functional completeness. As a con- 
sequence of Lemma 2 (see case (0.1) in the proof) we have 


rióar(zH e li)oi = x 

riz -ójoj = x, with (ó) 

be (riz 010i) = uz, by Lemma 1 

ri(rpdor)~ (1o è (ó10j)) = riónr, with (91.1), (01.2), (u6) and (40.1) 
ócy (1a 9 (6107) = Sor, with 1Pry = 1o and (7) 


which, with (56°), (e), rp1p714 = 11, (c) and (cc?) yields (oô). (Note that we 
.have used (a) in the justification of (03); here we dont use (03), but only the 
isomorphism between I and ODI.) 


The assumptions concerning —, €, ^, T, V and L are obviously independent 
from modal functional completeness. Note, however, that we wouldn't have func- 
tional completeness with V if we didnt have —, or without a primitive distribution 
arrow d (with uzy of type Be DA F C instead of OA e B - C, we would need +, 
or a distribution arrow of type (E V F) eG - (E e G) v (F eG)). A related fact 
(about which we heard from Djordje Cubric) is that a bicartesian category is func- 
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tionally complete in the ordinary sense if and only if it is distributive; in that case 
^, usually written x, plays the role of e for formulating functional completeness. 


To conclude this section, let us note that for a weakened form of modal func- 
tional completeness where we would be happy with asserting that ‘x is onto, with- 
out necessarily being one—one, we would need far less assumptions for our cate- 
gories. In the justification above we could appeal only to Lemma 2, and for proving 
this lemma we dont need 19r A = 1ga, the pentagonal, hexagonal and octagonal 
equations, (bw), (cw), (ckw) and (dkw). 


6 SUBSTRUCTURAL FUNCTIONAL COMPLETENESS 


Let us now take NL deductive systems with the unary operations on arrows De_and 
-e D primitive instead of the binary operation on arrows .e., and let us consider the 
following hierarchy of nonmodal deductive systems, obtained by assuming, in ad- 
dition to what we have for NL deductive systems, the structural arrows mentioned 
in parentheses without provisos concerning modalized objects: 


AL deductive systems (b arrows) 

M deductive systems (b and c arrows) 
BCK deductive systems (b, c and k arrows) 
R deductive systems (b, c and w arrows) 

H deductive systems (b, c, k and w arrows). 


For easier comparison, we use the labels introduced in [3]. The label ‘AL’ stands 
for ‘associative Lambek’, ‘M’ for ‘multiset’ (in the light of latter-day developments, 
it would be more intelligible if we called these systems linear—more precisely, in- 
tuitionistic linear), ‘R’ for ‘relevant’ (intuitionistic and without distribution of A 
over V) and ‘H’ for ‘Heyting’. The label ‘BCK’ is pretty standard (the linear logic 
trade has recently produced some ersatz names for BCK systems; the BCK systems 
we consider here are of an intuitionistic sort). 


Let us write S for NL, AL, M, BCK, R or H (the variable ‘S’ stands for ‘sub- 
structural’) and let S categories be S deductive systems that are NL categories and 
satisfy moreover the equations between arrows of Section 2, which, of course, ap- 
ply only if the arrows in question are present in the deductive system. So, AL cat- 
egories must satisfy b equations, M categories b and c equations, BCK categories 
b, c and k equations, R categories b, c and w equations except (ckw) and (ékw), 
and H categories all the b, c, k and w equations. 
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For all S categories except NL and AL categories, we have that A — B is nat- 
urally isomorphic to B + A, as in Section 4. However, for none except H cate- 
gories, which are the bicartesian closed categories of Section 4, we need have that 
Ae B is isomorphic to A ^ B. Only for BCK and H categories I must be naturally 
isomorphic to T. The AL categories are monoidal biclosed with respect to e, I, + 
and +, and M categories are symmetric monoidal closed with respect to e, I and 
—. However, they are all also bicartesian with respect to ^ and V, their terminal 
object being T and their initial object L. 


We can extend the axiomatization of S categories to the axiomatization of the 
corresponding SO categories as we extended the axiomatization of NL categories 
to the axiomatization of NLO categories: we just add the missing modal assump- 
tions from Section 2. For example, the axiomatization of M categories is extended 
to the axiomatization of MO categories by adding the operation on objects O, the 
r arrows, the operation on arrows and the modal structural arrows k4 and wa, 
with the proviso for modalized A, together with the O, k and w equations (the b 
and c equations are already assumed for the unrestricted structural arrows of M 
categories). The MU categories correspond to intuitionistic modal linear propo- 
sitional logic. For them we can prove modal functional completeness as we did 
for NLO categories, and similarly: with other SO categories. For HO categories 
we have to add just the r arrows, the operation on arrows Ë and the O equations, 
which gives categories corresponding to intuitionistic S4 propositional logic. For 
these categories we can prove modal functional completeness, but ordinary, non- 
modal, functional completeness fails, though we have it for H categories, as shown 
in Section 4 (otherwise we would always have arrows of type A - DA in HO cat- 
egories). We can also infer the necessity of assumptions for our modal categories 
as we did in Section 5. 


We want now to state a general functional completeness theorem, which as a spe- 
cial case covers ordinary functional completeness for H categories, i.e. bicartesian 
closed categories, and in other cases yields restricted functional completeness for 
other S categories. To state this theorem we need special notions of polynomials in 
polynomial categories, which we proceed to define. 


Given an S category C, we build the polynomial S category C[z] with x : It A 
as before. This we can always do because, for every S, the S categories are equa- 
tionally presented. If all arrows of C [x] are called polynomials, then our notion of 
polynomial satisfies the following clauses (parallel with the inductive clauses for 
Liz in Section 3): 


(P0.1) The arrow z is a polynomial. 


(P0.2) Every arrow of C is a polynomial. 
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(Pl) Ifw: Dt Band€é: Et F are polynomials, then £y is a polynomial. 


(P1.1) If 5 : Dt E isa polynomial and h : E | F is an arrow of C, then hy is 
a polynomial. 


(P1.2) I£ h : Dt E isan arrow of C and £ : E+ F isa polynomial, then £h is a 
polynomial. 


(P2.1) If y is a polynomial, then E e v is a polynomial. 

(P2.2) If y is a polynomial, then v e E is a polynomial. 

(P3.1) Ify: Ee Dt F isa polynomial, then *% is a polynomial. 

(P3.2) If y : De EFt F isa polynomial, then V* is a polynomial. 

(P4)  Ifyp: Dt Eand£: Dt F are polynomials, then (v, £) is a polynomial. 
(P5)  Ify: EF- Dandé: F F D are polynomials, then [v, £] is a polynomial. 


Clauses (P1.1) and (P1.2) are redundant in the presence of (P0.2) and (P1), but 
we have listed them nevertheless because we need them for the inductive definitions 
of restricted notions of polynomials. These notion are obtained by assuming the 
clauses mentioned in parentheses: 


M polynomial (all clauses save (P0.2) and (P1)) 
BCK polynomial (all clauses save (P1)) 

R polynomial (all clauses save (PO.2)) 

H polynomial (all clauses) 


(NL and AL polynomials will be considered below). Of course, H polynomials are 
not restricted: they coincide with all the arrows of C[z]. (As we have just noted 
above, we may omit clauses (P1.1) and (P1.2) from their definition.) However, the 
other notions of polynomial reject some arrows of C[z]. 


Before looking into that, let us make a point concerning the nature of the arrow 
x in C[x]. This arrow must be new to C (otherwise, with NLO categories we would 
need the equation zP'kr4 = lga to make (10.1) and (50.2) match; with H cat- 
egories we would need zk4 = 14, which doesn't hold necessarily in cartesian 
categories). However, nothing prevents us from introducing a new arrow xz of type 
I F I, which in the course of constructing C[x] will be identified with 1; for some 
categories (in NLU categories we have 1Hkor = 1p) because kpi = ry; in BCK 
and H categories, and NLO categories as well, we have ky = 1r). So, in some S 
categories, an arrow of C [x] may qualify as a polynomial on more than one ground: 
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in BCK, and hence also H, categories x : IF I will be a polynomial both by (PO.1) 
and (P0.2). 


It can easily be checked that if we exclude the operations on arrows mentioned 
in clauses (P4) and (P5), an M polynomial is an arrow of C[] in whose construction 
x occurs exactly once, a BCK polynomial an arrow of C[x] in whose construction x 
occurs at most once, and an R polynomial an arrow of C[r] in whose construction 
x occurs at least once. 


Note that for R and H polynomials we obtain as a derived clause: 
(P2) If v and € are polynomials, then y e € is a polynomial. 


This is because w e € is equal by definition to (7 e E)(F e£) in all S categories (we 
have taken the unary operations on arrows D e _ and . e D as primitive), and the 
arrow (y e E)(F e£) is an R polynomial by (P2.1), (P2.2) and (P1). Since we lack 
(P1) for M and BCK polynomials, we shall also lack (P2). This is parallel with the 
fact that if we replace clause (u2) by (u2.1) and (12.2), the w arrows enter into the 
definition of uy only via clause (u1). The k arrows enter into this definition only 
via clause (10.2). 


Let now the function 'z be defined as it was defined in Section 4: 
Forz:I- Aandf: Ae BHC,  f'z =a f(zelp)os. 
For pz as in Section 4, we replace clause (0.1) by 
Hst = A 


and assume (11.1), (u1.2), (u2.1), (42.2), (u3.1), (3.2), (u4) and (u5) with O 
deleted everywhere. Clause (0.2) with O deleted will be assumed only in the pres- 
ence of k arrows, which in the present context means only for S being BCK or H. 
Similarly, clause (41) with O deleted will be assumed only in the presence of b 
and w arrows, which in the present context means only for S being R or H. When 
we refer to the 44 clauses from now on, we assume these are the newly introduced, 
nonmodal, clauses. If S is R, clause (1/1) is independent from (j:1.1) and (1.2), 
because arrows of C dont qualify as polynomials. If S is H, we have to show this 
clause is compatible with (11.1), (41.2) and (40.2); this we do by deriving (11.1) 
and (11.2) from (u1) and (10.2), quite analogously to what we did in Section 3. 
Since D e _ and . e D are primitive, instead of -è _, we shall have clause (12) with 
O deleted only as a derived clause when b, c and w arrows are present, and, again, 
this will happen only for S being R or H. That (u2) can actually replace (12.1) 
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and (12.2) will be the case only for H (we use (0.2), (skw) and (ókw) to derive 
(u2.1) and (u2.2) from (2)). 


We can now state our general functional completeness theorem: 


Substructural Functional Completeness Theorem If S is M, BCK, R or H, then 
for every S polynomial o : B | C of the polynomial S category C [x] built over the 
S category C with x : I F A, there is a unique arrow f : Ae B F- C of C such that 
f(x *1p)o*, = v holds in C[z]. 


In other words, the function ‘x is an onto and one—one function from the arrows 
f: Ae BE C ofC tothe S polynomials of C[z]. 


We can prove this theorem by a straightforward adaptation of the argument in 
Section 3. For example, for M categories, we have to check that for an M polyno- 
mial q the k and w arrows are not involved in 4,9, and that Lemmata 1—3 can be 
demonstrated for M polynomials y and y, and M categories C. For other S cate- 
gories covered by the theorem, we similarly have to check that for an S polynomial 
y the structural arrows rejected in S categories are not involved in 1; «, and that 
the proofs of Lemmata 1—3 work. 


In analogy with the demonstration of necessity of Section 5, we can show that 
structural arrows and equations we have assumed for them are necessary. This is 
now simpler than in Section 5, since the complications involving modalized objects 
are eschewed. In particular, we can completely justify pentagonal, hexagonal and 
triangular equations. 


In [3], our restrictions concerning polynomials are matched by restrictions con- 
cerning structural rules in the deductive metalogic, and it is demonstrated that if the 
metalogic is appropriately restricted, BCK logic is minimal in the presence of A for 
proving the deduction theorem and its converse. The discrepancy between that and 
the Substructural Functional Completeness Theorem, which covers also M and R, 
is explained by the fact that [3] always allows the polynomials (7, h) and (h, v) 
where ~ is a polynomial of C[x] and h an arrow of C. Here, we allow that for BCK 
and H polynomials, but not for M and R polynomials. 


The Substructural Functional Completeness Theorem does not cover NL and AL 
categories. The corresponding notions of polynomial should presumably be 


NL polynomial (all clauses save (P0.2), (P1), (P2.1), (P2.2), (P3.1) 
and (P3.2)) 
AL polynomial (all clauses save (P0.2), (P1), (P2.1) and (P3.1)) 
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because, without (2), the b arrows enter into the definition of p, via clauses 
(u1), (42.1), (u2.2), (43.1) and (13.2), and the c arrows via clauses (12.1) and 
(u3.1). If for o : B H C, the arrow usy were redefined so as to be of type 
BeAt C instead of Ae Bt C, then for AL polynomials we would reject (P2.2) 
and (P3.2) instead of (P2.1) and (P3.1). With NL and AL polynomials we shall run 
into trouble in the proof of Lemma 1, because, for example, the NL and AL poly- 
nomial xo] is equal to c A(I e x), by (c), and u; (c A(I e x)) is, by (w1.1), (u2.1) 
and (0.1), equal to 


OA (I e da)br A 1(CAí e Dba 11 


which involves both b and c arrows. The problem is that for S being NL or AL, 
the class of S polynomials considered as arrows in an S deductive system is not 
closed under equality of arrows in S categories (since we lack (P2.1), the arrow 
€ A(Iez) is neither an NL nor an AL polynomial), whereas for S being M, BCK, R 
or H, this is the case. The proof of Lemma 3, too, does not work for NL categories, 
because of an essential use of b arrows: f'x is not an NL polynomial. However, 
the proof of Lemma 2 works, and we can demonstrate a weakened version of the 
Substructural Functional Completeness Theorem with NL or AL substituted for S 
if we dont require that f be unique; i.e. ‘x is onto, but not necessarily one—one. 


7 CONCLUSION 


We conclude this paper by brief indications about matters related to our results that 
we intend to treat in the future. 


There is another way of restricting functional completeness, different from the 
way of the Substructural Functional Completeness Theorem. We may redefine p, 
so that for an arbitrary polynomial o : B - C (i.e. an H polynomial), the arrow uzy 
is not necessarily of type Ae B + C (nor Be At C), but of some type B[A] - C, 
where B[A] is obtained from B by replacing factors D of B by A e D or D e A, 
there are as many A's in B[A] as there are z's in y, and these A's are distributed in 
B[A] in a way matching the distribution of z's in y. This distribution requirement 
becomes unnecessary in the presence of b and c arrows, whereas there can be more 
A's in B[A] than z's in ¢ in the presence of k arrows, and less in the presence 
of w arrows. With such a jz, we may also be able to prove restricted functional 
completeness for various S categories. 


Next, as functional completeness for cartesian closed categories enables us to 
extract systems of typed lambda terms as the internal languages of these categories, 
so modal functional completeness should lead to a kind of system of lambda terms 
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with modalized types, such a system being the internal language of an SO category. 
On the other hand, the restricted functional completeness of Section 6 leads to sys- 
tems of typed lambda terms with restricted functional abstraction (for example, we 
may bind with a lambda operator exactly one variable, or not more than one, or at 
least one). 


At the end of [3] it is supposed that freely extending a nonmodal S category to 
an SO category might result in the former being a full subcategory of the latter. We 
suppose this could be demonstrated by a normalization technique, perhaps inspired 
by Gentzen's methods, or lambda conversion. 


Another matter we will try to consider is the relationship between functional 
completeness and coherence. It is remarkable that assumptions about categories 
that Mac Lane needed to prove coherence for monoidal and symmetric monoidal 
categories should reappear as necessary for proving functional completeness. We 
conjecture that some sort of equivalence between coherence and functional com- 
pleteness could be established. The reason for this equivalence should be that in 
functional completeness we transform a polynomial p : B + C into pry : 
A e B l- C irrespectively of where x occurs in y. This requires that certain di- 
agrams whose nodes are obtained from B by replacing factors D of B by Ae D of 
D e A should commute. And the commuting of these diagrams should be sufficient 
for functional completeness. 
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NOTE ADDED IN PRINT 


The proof of Lemma 1 is incomplete as it stands, because it is not necessarily the 
case, as claimed in the second paragraph of that proof, that if y and w are arrows 
of C and y = v holds in C(x), then y = v holds in C. When this is not the case, 
the proof should be phrased as the corresponding part of the proof of Proposition 
6.1 in [6, chapter I.6]. The calculations of the proof of Lemma 1 are sufficient for 
this rephrasing. 


In the comments about the bijection ‘x one should bear in mind that if ODA e _ 
is not one-one on objects, the bijection is only local; i.e., it exists only between the 
arrows f : ODA e B F C and the polynomials y : B + C for B given in advance 
(and the same with A e . instead of DA e .). 


Kosta Došen 
Université Paul Sabatier, France, and 
Mathematical Institute of Belgrade, Yugoslavia 


Zoran Petrić 
University of Belgrade, Yugoslavia 
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A COMPUTATIONAL INTERPRETATION OF 
MODAL PROOFS 


1 INTRODUCTION 


Proof theory of modal logics, though largely studied since the fifties, has always 
been a delicate subject, the main reason being the apparent impossibility to obtain 
elegant, natural systems for intensional operators (with the excellent exception of 
intuitionistic logic). For example Segerberg, not earlier than 1984 [5], observed that 
the Gentzen format, which works so well for truth functional and intuitionistic op- 
erators, cannot be a priori expected to remain valid for modal logics; carrying to 
the limit this observation one could even assert that ‘logics with no good proof the- 
ory are unnatural’. In such a way we should mark as ‘unnatural’ all modal logics 
(with great delight of a large number of logicians!). 

One of the main drawbacks of modal proof theory is well exemplified, in the 
context of the logics K, KT, and KD, by Scott’s rule: 


[ko 
Ort Qo 


This rule, though allowing the proof of a cut-elimination theorem, is neither a left 
nor a right rule, thus destroying the deep symmetries of the sequent calculus. The 
situation is even worse with the natural deduction formulation of the same logics, 
where either there are no explicit modal introduction and elimination rules, as in 
[5], or the modal rule is formulated as 


Ir 


which is neither an introduction nor an elimination rule and in which the same for- 
mula(s) appear in the premise twice, once boxed and once unboxed. 
The situation is somehow better for S4, with reasonable sequent rules, 


l,r-ro Orta 
— OF —— ——— 
r,OrF o or F- Oo 
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but, once again, problematic natural deduction. The sequent rules appear strictly re- 
lated to those for the universal quantifier, and following this analogy Prawitz [14] 
attempted the definition of a natural deduction system styled after the first order 
system. This naive approach, however, does not work, and in order to obtain nor- 
malization, Prawitz was forced to introduce more elaborated concepts and to for- 
mulate rules whose application depends on global constraints on the deduction (see 
Section 4.1). 

In the last few years, however, it has been discovered by several authors that the 
difficulty was in a too strict interpretation of the “Gentzen format’ and that several 
extensions [6, 3, 7, 11, 12] allow a good and ‘natural’ proof theory for modal logics. 

We will focus here on the proposal by the second author for the logic KD, whose 
main idea is a two-dimensional generalization of the notion of sequent. Instead of 
asserting provability (F) between two sequences of formulas, provability is asserted 
between two-dimensional sequences of formulas. 

Developed in [11] as a sequent calculus for classical KD, in [12] the approach 
is tailored to the intuitionistic framework, for which it is also given an equivalent 
natural deduction system. 

The goal of the present paper is to study the computational properties of the sys- 
tem in [12] and of several extensions to other logics, namely the positive fragments 
of K4, KT, and S4. 

To this aim, we introduce first a complete natural deduction system for each 
of the logics at hand. All the rules of these systems act only on the conclusions 
and/or the premises of the deductions they are applied to, no introduction rule has 
a premise containing the introduced connective (as it is case for rule OK, for in- 
stance), and the modal rules strictly match the standard rules for an universal quan- 
tifier, the side conditions on variables becoming a side condition on levels. 

One the main features of the approach is its scalability, the various logics differ- 
ring only in one parameter of the elimination rule for Cl. This allows both a compact 
treatment and the study of the ‘fine structure’ of the O modality (that is, how several 
sublogics interact to obtain a larger logic, in this case $4). 

By defining a suitable notion of reduction on the terms representing proofs (ex- 
tending standard G-reduction for A-terms), we obtain a natural computational in- 
terpretation for proofs, which is proved strongly normalizing and confluent. 

Finally, this local, two dimensional approach enforces remarkable global prop- 
erties on the resulting deductions. Though this is not the main subject of the paper, 
in Section 4.1 we will show that all deductions in our system for 54, in fact sat- 
isfy Prawitz's global requirements on proofs, thus giving an immediate forgetful 
translation of our system in Prawitz's one. 

It should be noted that Kripke's work on semantics of modalities [10] already 
contains Gentzen systems based on indexed formulas. Indexes represent paths in an 
abstract tree, whose obvious interpretation is the accessibility relation of a (Kripke) 
model. Formally, our approach is a particular case of Kripke's, where indexes are 
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just linearly ordered (the level structure). The intended interpretation, however, is 
not (and cannot be) a reflection of the accessibility relation into the syntax. The 
approach is motivated only by a ‘purist’ approach to modal proof-theory. It searches 
for local, geometrical properties of sequents in order to validate the introduction of 
modal quantifiers. 


2 SYSTEMS 


Before introducing formally the systems we will deal with, we briefly, and infor- 
mally recall the natural deduction approach taken in [12]. Let us denote formulas 


with lowercase Greek letters a, 8,0,..., and sequences of formulas with upper- 
case Gothic letters G, 8, ..., while e will be the empty sequence. An intuitionistic 
2-sequent is a two dimensional expression of the form 
G1 2 
G2 € 
(D |. 
DE: 
e o 


whose intended meaning is the formula 


NS > OA G2 5... a(A S D 0)...). 


Note that the formula c, the conclusion of the deduction, lies at a level, k, which 
is greater than, or equal to, the level of any assumption it depends on (any of the 
G;'s may be empty, of course). The propositional rules act over these two dimen- 
sional stuctures in the expected way, just ‘respecting the levels’. The rules for D, 
for instance, can be expressed as: 


Gi € 6, € Bi € 
Go € Go € Bo € 
22 E 22 E 22 E 
O,c T ST Gk ODT Bi o -E 
6i € 61, Bı E 
62 E 62, B2 E 
H : H 
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Formulas may change their level only by means of modal rules: 


61 € 
G2 € €i € 
. " Go € 
Gk E k H 
o 
€ o OT k d D£ 
Gi € 6 € 
Go € Go € 
F F 
e; Clo G; € 
€ 


Thus, the only way to introduce a modality on a formula occurrence ø at level 
k is that ø be the only formula present at that level. As a result of the rule, the in- 
troduced formula is lifted one level up. Vice versa, the elimination rule pushes a 
formula down one level (but there is no restriction on its premise). The levels thus 
represent in the calculus a notion of modal dependence: the conclusion c at level 
k modally depends on the assumptions at the same level. If there is not any such 
assumption, then we are allowed to assert that ø is necessary. 

As proved in [12], the given rules for 0 characterize the minimal normal modal 
logic K. ! In this paper we will extend this approach to a class of modal logics 
based on K (KT, K4, and S4), focusing on the computational aspects of the resulting 
proofs. 

Before going into the details of the systems, we adopt a more compact repre- 
sentation for 2-sequents. Instead of writing two-dimensional judgements, we will 
denote each formula ø at level k with o^ and write the judgment (2) as T H- o*, 


where 
Gi 
G2 


Gx 
will be seen as a multiset {ri 1... , T" }. The usefulness of this representation will 
be especially clear in Section 5. The reader should always bear in mind, however, 


that the indexes on formulas are only a metatheoretical notation for two dimen- 
sional structures. 


1More precisely, [12] introduces a system for the minimal deontic normal modal logic 
KD, that in absence of negation and © is equivalent to K. 
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2.1 Basic definitions 


Formally, formulas are built out of atoms (ranged over by p); compound formulas 
are obtained with the connectives: O (unary), A and > (binary). Any formula of the 
calculus will be marked with a level index, varying in Nt ; an indexed formula a of 
level i will be written c*. 

The following definition will introduce a calculus of terms and formulas, such 
that to any term M there will correspond a unique indexed formula ø+, called its 
type; this fact will be denoted with M : o*. For any indexed formula o? we as- 
sume the existence of a numberable set {x} , z5, ...} of variables of type o*. A set 
of assumptions (called sometimes also a basis, or a context) is a set l = (a2 
01 ,..., £i» : a}, where all the variables are different: if z^ : o* € T and z^ : 
T^ € I, then o* = 7^ (and in particular k = h). No variable name, hence, can ap- 
pear in a set of assumptions with two different levels (this is the level variable con- 
vention). For such a set of assumptions I’, define #I’ = max(k; | 27 : o7 ET}; 
#T = 0 when T is empty. Moreover, L'4* = (z^ : of €T | k < i}; the set 2* is 
defined analogously. Finally, for n € Z,T(n) = {x*t” : oft” | zë : of ET}. 


DEFINITION 1 [Calculus A] Terms and derivations are inductively defined by 
the following rules. 


rë : of 
To A r r 
M:o* N:7" M:oAr* M :o ^T* 
— AL ———M—— NE o0 ————M—— ^e, 
(M, NY :o AT" (fst M)* : a" (snd M)* : 7* 
T [x* : o*) T A 
M. 7 M:0 Dr" N:o e 
——————— — J 
(Az*:o*.M) : 0 D 7° (MN)* : 7" 


While we will always attach a level to a formula, in the exposition we will be 
more liberal for terms, writing both M : o* and M* : o*, since no confusion may 
arise in this way. 

If S is one of the systems introduced below, we write I ks M* : o* when in S 
there is a derivation: r 


Mi ot 
T Hs ot holds when there is a term M* such that Hs M’ : oè. 
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2.2 System XK 


The typed A-calculus A* K is obtained by adding to A the two rules we have already 
discussed: 


r r 


M? : o? 1 M? : Qo £ 
OL js#r oo D 

"m ungen( M? )?*! ; git! 
The informal meaning of these rules is thus the following. 


OZ If, from the hypotheses in I’, we derive that ø holds at knowledge level j, 
and nothing else holds at any level greater than or equal to 7, then Clo has to 
hold at level 7 — 1. 


OE If, from the hypothesis in I'; we derive that Og holds at knowledge level j, 
then c has to be true at the next level. 


This interpretation corresponds to the semantical property of the minimal nor- 
mal modal logic K: K is complete with respect to the set of Kripke frames whose 
accessibility relation is ‘irreflexive’, ‘asymmetric’ and ‘intransitive’. 

The characteristic axiom of K, Q(o D 8) D (Ga D OP), is derived in A^ K in 
the following way. 


z! : Da! y! : O(a D 8)! 
ungen(z*)? : a? ungen(y!? : 2 6? 
(ungen(y")'ungen(z^)^)' : 9 


DE 


OT 
Ay’: O(a D B)'.Ax’ :Ca’.gen((ungen(y’)’ungen(z’)”)?)' : O(a D 8) D (Ga > ng)! > 


2.3 System “KT 


The typed A-calculus A! KT is obtained from A K by extending the elimination 
rule for O. r 


M? : Do? ¢ 
ns » 
ungen( M?)* : o" "eoo HJ 


The informal meaning of such a rule is thus: 
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If, from the hypothesis in T, we derive that Oo holds at knowledge level j, 
then c has to hold at the next (7 4- 1) and at the current (7) level. 


We have a correspondence with the semantical property of KT: KT is complete 
with respect to the set of Kripke frames whose accessibility relation is ‘reflexive’, 
‘asymmetric’ and ‘intransitive’. 

By using the above rule we can prove the characteristic axiom of KT, Oa D a. 
r! : Do! 

OE 


ungen(z!)! : o! 


——— 5X1 
(Az! : Oa! .ungen(z!)!)! : Oa D a! 


2.4 System M K4 


The typed A-calculus A^ K4 is obtained from A* K by extending the elimination rule 
for O. 


D 


M? : Do? e 
—— DÉ p; 
ungen( M?)* : g* 2) 


The informal meaning of such a rule is: 


If, from the hypotheses in I’, we derive that Og holds at knowledge level 7, 
then c has to hold at any level greater than 7. 


We have a correspondence with the semantical property of K4: K4 is complete 
with respect to the set of frames whose accessibility relation ‘irreflexive’, ‘asym- 
metric’, and ‘transitive’. 

The characteristic K4 axiom, Da > OOa, is proved in the following way. 

r! : Oa! 

ungen(z!)? : o? 
gen(ungen(z!)?)? : Oa? 
gen(gen(ungen(z!)*)?)! : ana! 


————————————— 51 
(Az! : Oa! .gen(gen(ungen(z!)3)?)!)! : Qa > anat 
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2.5 System A*S4 


The typed A-calculus A*5S4 is obtained from A K by allowing both the A* KT and 
the A^ K4 O elimination rules. 


T 


M? : Oo! ne 
LLL "y 
ungen( M?)* : of ^ 


The informal meaning of such a rule is: 


If, from the hypotheses in I’, we derive that Oc holds at knowledge level j, 
then c has to hold at each level greater than or equal to 7. 


We have a correspondence with the semantical property of S4: S4 is complete 
with respect to the set of Kripke frames whose accessibility relation is ‘reflexive’, 
‘asymmetric’, and ‘transitive’. 

Since the characteristic axioms of 54 are those of K, KT, and K4, they can be 
proved in A^ S4. 


2.6 Implicit systems 


From the systems we have introduced, we may obtain 'implicit' calculi, forget- 
ting some of the (redundant) information. First, the implicitly typed A-calculi A* K, 
A* KT, A* K4, and A* S4 can be defined, by erasing from the terms the level and 
type decorations. We loose a 1-to-1 correspondence between proofs and terms, but 
we gain in readability. The proof terms corresponding to the characteristic axioms 
of the various logic in the implicit form are thus the following. 

l-3«  Ay.Ax.gen(ungen(y)ungen(z)) : O(a D 8) D (Ga D ug)! 

Farr Az.ungen(z) : da Da 

Fas g4 Az.gen(gen(ungen(z))) : Oœ 2 aia! 

Second, we may forget A-terms altogether; the resulting systems are less verbose 
and we will often resort to them in the sequel. Figure 1 summarizes these systems. 


3 HILBERT-STYLE SYSTEMS, A COMPARISON 


In this section we show that provability in the calculi A! K, A! KT, A* K4, and A*.S4 
is equivalent to provability in the positive fragments of the Hilbert-style formula- 
tion of the corresponding logics. Let L be the positive fragment of one of the logics 
in {K, KT, K4, S4}. 

For the purpose of this section, we omit the term labelling for the derivations in 
the calculus; moreover, if T = {o7,...,0%}, then I? = (o1,...,0,]. 
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Dg? 
— pet 
o 


tk=j+lin\'K; 


k € (j, 3 +1} indo KT; 
k>j+1indAt KA; k > jinA' SA. 


FIGURE 1. Term-free systems. 
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THEOREM 2 
Hro <> Furo! 


The proof of this fact breaks down into the following lemmas. 
LEMMA 3 Fro => Fy at. 


Proof. By induction on the lenght of the derivation. Propositional axioms and modus 
ponens are trivial. The proofs in A^ L of the modal axioms of L have been given in 
the previous section. For the inference rule (Gen), suppose we proved Fz ø. By 
induction hypothesis we have Fe; ot}. Replacing now any index level k in this 
deduction with k + 1, we obtain a deduction of Fe; a°. An application of OZ 
gives tye, Dol. E 


The following ‘translation function’ corresponds to the intended meaning of our 
judgments, as informally stated in the Introduction. 


DEFINITION 4 

e TTFa] - AI^ 5a 

e Tt att] = A(F*?)* 5 a(T[P2?(-1) F o7]) 
where, NØ = T (any tautology). 
LEMMA 5 THa => +, TPF a’). 


Proof. By trivial, and long, induction on the deduction T ye, a. a 


4 INTERMEZZO 


4.1 On the geometry of proofs 


One of the most interesting properties of the systems we have proposed (or better of 
the level machinery), is the geometry they enforce on the deductions. In particular, 
we show in this section that all (not necessary normal) deductions involving a OZ 
have a very remarkable global structure. 

Before showing the technical fact we have in mind, however, we recall a dif- 
ferent natural deduction system for S4, namely the ‘third version’ of the calculus 
discussed in Prawitz's classical monograph [14, Chapter VI, pag. 79]. The propo- 
sitional rules of that system can be obtained from those of Definition 1 by simply 
erasing the terms and the level indexes (thus obtaining the usual rules for intuition- 
istic propositional logic). As for the rules for O, the elimination is the same as ours 
with the omission of terms and levels, while the introduction rule (which we will 
call OZ>p) is rather elaborate, in order to ensure normalization and the subformula 
property. An application of CIZ» has the form: 
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Dı D, 
[Or e. Orn] 
D 
oT 
— ULp 
Oo 
where 07;,..., C7, are all the open assumptions of D, and no open assumption in 


the deductions D; (of conclusion 017;) is bound in D.? In other words, an applica- 
tion of OZ>p is obtained by taking a deduction 


Or; eee OT, 
D 


g 


where all open assumptions (if any) are boxed, and plugging into these assumptions 
arbitrary derivations with the right conclusion. Rule CZ» is far from being ‘natu- 
ral': it allows normalization, but at the price of a strong globally stated constraint 
on its application. ? 

Coming back to our system, we can show that all applications of OZ, in fact, 
comply with Prawitz’s requirement. In order to give a clearer picture of the involved 
deductions we will dispense from terms. 


THEOREM 6 Let € be a deduction of T F «4 o", with k > #T. Then E has the 
form 


I4 I 
Di D, 
[Or — ... Orbe] 
D 
ok 
where: (i) any I’; is non empty; (ii) the union of the l';'s is U'; (iii) any Dj is a de- 
duction of T ; Fye54 Dr ; (iv) D is a deduction of Or; yee, Ur Eug, o^; (v) 


for any j, hj < k. 


Proof. If I is empty, then the statement is vacuous (n = 0 and D = £E). Otherwise, 
the following informal algorithm will produce the required data. 

Let r be any path in £ from the conclusion o* to an occurrence of an open as- 
sumption p € I. In r there must be an application of OE, since k > #T and DOE 


*Prawitz’s original definition is formulated via the concept of essentially modal formula. 
It is not difficult to see that our formulation of OZp, though not equivalent (it allows less 
deductions), it is still complete for the ^, D, O fragment. 

3In order to introduce O we must look into the proof tree. 
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is the only rule allowing its conclusion to be at a higher level of its premise. Pick 
the first application (starting from the conclusion up) of O€ in which the level h 
of the premise (say, Or") is strictly less than k. We choose this Or” as Or)"; D, 
is the subtree of € rooted at Or”; T; is the set of all open assumptions of the de- 
duction D, with conclusion Or”. Clearly p € T',, thus establishing (i) and (iii) for 
j = 1; in order to eventually establish (iv), we have to guarantee that no assump- 
tion in I'4 can be discharged below Or}. However, this is obvious, because any 
formula appearing in the path r below Or" have a level | > h (by construction), 
while 3:T, < h, (see Proposition 9), thus forbidding the application of a D € at 
level | below Or". 
The previous procedure can be repeated until there are no more occurrences of 
assumptions in I" not already allocated to some I';. 
When the algorithm terminates, (ii) is obvious, and (iv) follows by construction. 
a 


Writing Hp for provability in Prawitz's system and denoting with ( )? the ob- 
vious function stripping the level indexes out of a formula (deduction, etc.), the 
following is straightforward. 


COROLLARY 7 If D is a deduction of T- Fy«g4 o", then D? is a deduction of 
I’ -» o. 


Not any proof in Fp can be decorated with levels to become a proof in our sys- 
tem, as the following deduction shows: 


Proofs in our system, however, are still enough for completeness and, moreover, 
seem to have a more direct interpretation as travelling in a Kripke model (cfr. [3]). 

Many consequences can be drawn from Corollary 7. Since any reduction step 
(see Section 6) in our system becomes also a reduction step inp under the (_)? 
translation, we immediately obtain normalization for AÉ S4 from the normalization 
theorem for p. We will give a different proof of strong normalization for A* S4 in 
Section 6.2 

Moreover, by adding to the usual clauses that, for any k, o* has to be considered 
a subformula of c^, we also obtain the following subformula property. 


THEOREM 8 (Subformula principle) Any formula occurring in a normal deduc- 
tion of T tyeg4 o* is a subformula of o* or of some formula in Y. 


Proof.[Sketch] It is enough to prove the theorem for Fp, which can be obtained in 
the same way as the corresponding result for intuitionistic logic [14, Theorem 2, p. 
53; cf. p. 80]. a 
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Starting from a purely structural extension of the syntax (the levels) we have thus 
recovered the global, elaborated constraint on the geometry of the deductions that 
other systems are forced to require from the beginning. 


4.2 A proof theoretical request... fulfilled 


One of the principal features of a good modal proof theory is its capability to suit 
a class of different modal logics without changing the way modal connectives are 
manipulated (e.g. introduced and eliminated). The differentiation between logics is 
delegated instead to the way (general) formulas are manipulated. 

A clear discussion on this topics (in the case of modal systems) may be found 
in recent work of Wansing [17] and Došen [6]. This point of view (that [17] calls 
Dosen principle, but that should be traced back to Gentzen, for the differentiation 
of intuitionistic from classical logic) may be stated as: 


The rules for the logical operations are never changed: all changes are 
made in the structural rules. [6] 


At first look our calculi violate this principle: different systems are obtained with 
different elimination rules. 

One could attempt a *weak' defense, arguing that all the proposed elimination 
rules are equal in ‘spirit’, since they differ only in level decorations. But this would 
miss the point. Indeed, we totally agree with DoSen principle and we claim that our 
approach does not violate it. .. when observed in the right context. The distinc- 
tion between logical and structural rules, in fact, is only relevant in sequent calculi, 
and it is thus there that the challenge posed by DoSen principle has to be attacked. 
The natural deduction system A‘ K has an associated 2-sequent calculus (see [12]), 
whose left and right O introduction rules are the following: 


61 E O1 € 

6» € G2 € 

. E: TEE 

6.1 € Gk € 
O,,0 T ok € o -n 

61 € 61 € 

65 € 62 € 

H o: 220 E 

6;,-1, Oe € Gk Oo 


Sequent calculi for KT and K4 are now obtained by adding the structural rules 
(on levels) Ft (KT) and H} (T4), respectively: 
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61 € 61 € 

G» E 62 E 

Db: 22 

G; E 6; E 
E 2 LT (KT) ~~ 22. LL (K4) 

6, € 6, E 

G2 € G2 E 

F: F 

e. o Gk E 

e" e" 

€ Oo 


where n > 1 and 


Ee" = : p> ntimes 
€ 


The calculus for S4 is obtained by adding to the basic logical rules both Ft (KT) 
and H4 (T4). Thus, in a sequent setting, the four systems are obtained by means of 
structural rules. 

The structural rules of sequent calculi, on the other hand, should not be explicit 
in the corresponding natural deduction formulation. Intuitionistic contractions and 
weakenings, for instance, are realized by the convention on the discharging (meta-) 
operation. This is why our calculi do not contain rules like 


T 


g? 


Following the traditional natural deduction approach, such rules have been embed- 
ded into the (logical) DE rules. 


5 WORKING WITH A*S4 PROOFS 


In this section F will stand for P 4754, whose rules are recalled in Figure 2. 
PROPOSITION 9 Let T + M*: oè. Then i > #I. 
Proof. Inspection of the rules. a 


DEFINITION 10 A derivation T + M? : o* is concluded ifi = 1 and #T < 1. 
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r:o" 
r A r r 
M:o*  N:r* M:oAr* M:o^r* P 
NC, 
(M, N):o ^T^ (fst M) : o" (snd M) : 7* 
T [z:0*] r A 
M:r* M:aDr® N:o" 
DT E DE 
(Az* : o*.M)* :0 5 r^ (MN) : 7 
r r 
M : o? M : Qo? - 
re ungen(M)* : o" "21 


FIGURE 2. Rules for A£ S4 


THEOREM 11 Any non concluded derivation can be extended to a concluded one. 


Proof. Given a non concluded T H- o*, first replace any o^ € T with: 


Oo! £ 
—— 0O 
g” 


At this point apply k — 1 times the rule OZ to the conclusion, obtaining a formula 
of level 1. = 


The following technical lemmas aim to show, given a deduction for - c^, how 
to obtain a deduction for the same conclusion at a different level, possibly changing 
also some of the levels in L'. We will see in Section 6 that this operation is essential 
for the computational interpretation of proofs. 

At first sight the operations on proof trees given in this section may appear heavy 
and difficult to grasp. Notice, however, that the problem we tackle is essentially the 
same as the one arising in the first order natural deduction system NJ, when, given 
a deduction for T - a(x) and a term t, we want to obtain a deduction for I" + a(t). 
Many texts say, in this case, that all it is needed is the uniform substitution of ¢ for 
z in the proof tree. A careful analysis of the process involved, however, shows that 
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care must be taken in performing this operation; in particular, it is needed a strong 
discipline in the use of variables. A very clear account of the issues involved is 
given in [16, Volume II, Ch. 10]. Levels in A^S4 play a very similar role to vari- 
ables of NJ; the following lemmas can then be seen as expressing concepts like 
‘renaming’, ‘term substitution’ and their properties. 


LEMMA 12 Letk » 0.Tl'-o? => Vi€j TSi P2 Gk) b att 
Proof. 


e o) Fal > aitt € T2448); then o? ** H o3 **; 


T ai T<? P2*(44), o? ** 
I.H. : 
e if 7 then l 
p? Bitk 
a > P a D Bitk 


P<? P2*(4n) 


L.H. 
e if t then : 
a qitk 
Oa-1 Do3-1*k 
Tr 
e if : | we have two subcases: 
Do? 
ar 


(i < 7) in this case we have: 
P<? T2*(.x) 
I.H. 
O07? ** 
artk 
(j <i € r) in this case, without applying the induction hypothesis and 


noting that l'2* = Ø by Proposition 9, we have: 
Pst T2*0) 


Do? 
artk 
i 
LEMMA 13 Let 22.TFo9? => Wi(2<i< j)T<*,T2*(-1)F o^ 
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Proof. 


e o) tal > o?! €T2*-1) then o?^! + 977 


T a P<? [24(~-1),a77} 
I.H. : 
e if t. then : 
p p 
aD 3 a D gi-! 
r<* [2*(-1) 
I.H. 
e if t". then : 
as oj! 
[97-1 Oas-2 
D 
e if : | we have two subcases: 
Do? 
ar 


(j 22,0 € j) in this case we have: 
T4 T2*(-1) 
I.H. 
m 
P 
(j <i € r) inthis case, without applying the induction hypothesis and 


noting that TZ! = Ø by Proposition 9, we have: 
<i, T2*(4) 


Do? 
a7 


The operations on deductions given in the proof of the two previous lemmas is ex- 
plicitated on proof terms by the following definition. 


DEFINITION 14 (Level substitution) LetI' - M : o”; letn E€ (C113UN 122 
if v > 2, and n € N,i > 1 otherwise. We inductively define the level substitution 


[n]; M (read: increment by n any level greater than or equal to 1). 
[fu <2: 


[n]; M" = M” 
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Ifv 2 i: 
[n]iz" = z+” 
[n]; (M, N)* = ([n; M, [n]; Ny" +" 
[n];(fst M)" = (fst [n]; M)"*^ 
[n]; (snd M)" = (snd [n]; M)"*" 
[n];(Az" :o* .M)" = (Az"*" :; "*".[n]; M)"*^ 
[n] (M.N)" = ([n]; M [n]: N)” +” 
[n];gen(M)" = gen([n];M)"*" 
[n];ungen(M)" = ungen([n];M)"*" 


Whenever we write a level substitution [n]; M, we assume that all the constraints 
of the definition are satisfied (in particular those on n). Note that [0]; M" = M". 
Lemmas 12 and 13 can then be reformulated as follows. 


LEMMA15TF-M:o) => Wi<j, TS5T2*Ga)F [-k]; M : o?** 


LEMMA 16 Letj >2,then TE M:a? => VWi,2<i<j, T< ržł({-1)} 
[-1];M :oj71 


We can picture the effect of the level substitution on proof trees as follows, ob- 
serving first that Theorem 6 can be generalized. 


THEOREM 17 Let € bea deduction of V + o*, and leti < k. Then € has the form 


ri ri 
Dı Dn 
[ary ev D^] r2 
D 
ct 


where: (i) any rs is non empty; (ii) the union of the rs ’s is P<"; (iii) any Dj is 
a deduction of rs H or; ; (iv) D is a deduction of Or! joo Uri, T2 EF of; 
(v) for any j, hj < 1. 

Proof. A simple variant of the proof of Theorem 6, or, for the reader preferring 


recursion to iteration, by aroutine induction on the lenght of €, similarly to Lemmas 
12 and 13. BH 
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In the same hypotheses of the theorem, now, the deduction [m];£ is 


rt re 
D Dn 
[Ore Or) Tim) 
[m];D 
E 


where all the formulas appearing in D are affected by the level modification, while 
all the D;'s deductions are left unchanged. 
Next lemma shows how level substitution interacts with term substitution. 


LEMMA 18 ([n]; M)[[n]; N/z**"] = [n] (M[N/z*]) 


Proof. Just note that if z* € FV(M) then any subterm of M containing x* has a 
level h > k.If z* g FV(M) then the variable-level-convention ensures the thesis. 
a 


We state, next, a property of nested level substitutions. 
LEMMA 19 (Level substitution lemma) Let k > 1, n > —1, and j > 0. 
(i) Ifi < k — 1, then foranyl > k, [n]; [j — Ue M' = [j — 1]ees[n]; M". 
(ii) Ifk-1« i € k—-1-j, thenforanyl > k, [n;[j - 1], M! = [j—1 +n] M*. 


6 COMPUTATIONS 


In the previous section a class of modal typed A-calculi has been introduced. It is 
now necessary to study their computational behavior, by introducing suitable no- 
tions of reduction extending the usual typed 8-reduction and corresponding to the 
process of proof normalization. We will be able to show that this reduction is con- 
fluent (or Church-Rosser) and strongly normalizing (or nótherian). 


DEFINITION 20 (Contraction) ((Az? :03.M?)? NIYI b MINI /z?] 
(fst (MÍ, NIYI) > M? 
(snd (M3, NIY) p. NI 


ungen(gen(M^)*-1)*71*3 © [j — 1]. M* (forj € N). 
The compatible closure of > (or one-step reduction) is denoted with —; its tran- 


sitive closure is +$, while — (the reduction relation) is the transitive and reflexive 
closure. 
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The following theorem expresses the correctness of Definition 20 with respect 
to types. 


THEOREM 21 IfT + M? : à? and M? — NÄ, thenT' + Ni: oi. 


Proof. The only non standard case is when the reduction is a modal contraction. 
From M = ungen(gen(P*)*—-1)*-!+3 we have T - P* : Go* and k > #1. The 
level substitution in [j — 1] x. PF, then, does not affect the free variables of P and 
Lemmas 15 and 16 give the thesis. E 


Reduction and level substitution match well: the former is preserved by the lat- 
ter, as the following results will show. 


LEMMA 22 If M^ > N°, then for any i € hand any n, [n; M^ > [n]; N^. 
LEMMA 23 If M^ — N” then for any.i € h and any n, [(n; M^ — [n]; N^. 
THEOREM 24 If M” — N", then for any i < h and any n, [n]; M^ — [n]; N^. 


Proof. By induction on the lenght of the reduction, using Lemma 23. B 


6.1 Confluence 


We prove the Church-Rosser property for —, using Tait's technique as formulated 
in [9] or [1], here adapted to modal terms. We define first a new auxiliary notion of 
reduction ( 23 ), corresponding to the (possible) parallel contraction of several non 
overlapping redexes; note that — is not transitive. Then, — is shown to respect 
both term and level substitution; from this we prove that * 2 satisfy the diamond 
property’, Lemma 28. Since — is obviously the transitive and reflexive closure of 
=} , we have a standard proof that — is Church-Rosser. 

In the following proofs we will give only the main (modal) cases. For the others, 
which are standard, the reader is referred to the literature. 


DEFINITION 25 The one-step parallel reduction is defined by the following clauses. 


i. M:a* 3 M:o*; 
ii. If M:o* = M':o*and N : B* 3 N' : pF, then 
(M, N)*:a^B* = (M', N): an fih; 
iii. IfM : B* = M' : pF, then 
(Az®:a*.M)*:a DBF = (Ax*:o*.M")* :a D p"; 
iv. IfM : a^ = M':af, then 
gen( M)*-1 : Do^-1 = gen(M')*-} : Da*-!. 
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v. fM :a^B* 3 M':aN pF, then 
(fst M)* : a* = (fst M")* : af and (snd M)* : B* = (snd M")* : BF; 


vi. If M: a* 3 M':atand N :a D B® 3 N':aD p, then 
(NM)* : BF 3 (N'M'Y : BF; 


vii. If M :Qa* 3 M' : Ook, then 
ungen( M)" :a” = ungen( M')" : a”; 


viit. IfM : ak z3 M' : aœ" and N : 8* = N' : B*, then 
(fst (M, N)*)* : o* = M':a* and(snd(M,N)*)* : BF 3 N': BF; 


ix. If M : B* = M': BF andN:a* 3 N': aà, then 
(Azt :o*M)*N*)* : g* = M'[N'/x*] : Bt, 


rz. If M: o* 3 M':af, then 
ungen(gen(.M)*71)*71*3 : o9k-1*3 = [7 — 1, M' : ak-147, 


As usual, a derivation of M : 8^ = M!' : f^ is a sequence of reductions 
Mo: ^ 3 Mi:^,..., Mai: 8" 2 Mn: B”, M : 8^ 3 M"' : Bh, such 
that any reduction of the sequence is either an axiom (clause 7) or it is obtained by 
a clause from some previous reduction(s). 


LEMMA 26 If N : o* 3 N':o*, M:f^ 3 M': f^, and x* : a*, then 
M[N/z*]: B^ = M'[N'/a*] : 8^. 


Proof. By induction on the lenght of the derivation of M : 9^ 3 M' : 8^. 
Base case: M — M'. This case is established by a structural induction on M. 
Induction step: by cases on the last clause used to derive M : 9^ 33 M' : pP. 

We give some cases, labelled by the clause used. 


(iv) gen(M)^ : Qa’ = gen(M")^ : Oa", where M : o^ 3 M': 
a^*1, By induction hypothesis, M[N/z*]: o^*! 3 M'[N'/z*] : o^, 
from which the thesis, since gen(M)"[N/x*] = gen(M[N/z*])^ : ao^ 
and similarly for M”. 


(vii) similarly to the previous case. 
(x) ungen(gen(M)^71)^71*i ; gh71*t3. = [j 1] M' : a^71*5, where 
M:o^ 3 M':a*. 
The thesis is now given by the following diagram: 


ungen(gen( M)^- 1)^- V [N/a*] : y^h7 L3 =; (B — AMON I sah Vti 
ungen(gen( M)^"![N/z^])^ 13 : 9^7 1*5 -— [j — 1a (M'[N'/z"]) : aH 
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where the = in the bottom row is given by induction hypothesis and clause 

(x), while equality (*) holds as either z^ does not occur in M, or k < h. 

a 
LEMMA 27 If M : a* = M' :o*, then for any i and any n 
[n];M : o^*^ 3 [n]; M' : o**? 

Proof. If > k the statement is trivial; otherwise we proceed by induction on the 
length of the derivation of M : o^ 3 M':o*. 


Basis: If M = M' the thesis is trivial. 
Induction step: we give only the modal cases, labelled by the last clause used. 


(vii) M = ungen(N*—7)* : af and M' = ungen(N'^—7)* : a, 


with NETI : gF-3. = N'k-3 ; gk-5. 
The thesis is given by the following diagram: 


[n]:ungen(N*-3)* :o*t" — -3  [n];ungen(N'^-3)* : att” 


by IH l 
ungen([n]; N^77)**^ : o**^  —  ungen([n|; N'^-7)**7 ; qktn 


(x) M = ungen(gen(N^-7*1)*77)* : af and M' = [j — 1], ;41N'^77 : 
Qa’, 
with N*-3*1 ; gk-3*1 — WN'k-j*1 : œk-i+1 which by IH implies that 
Q) [n] NETI : [n;a^ ?*! — [n]; N'77*! : inja i+ 


We have two cases. If ? < k — j the thesis is given by the following diagram: 


[n]iungen(gen(N^77*1)k*7)* : atte = [nl[j — 1], ;4i N+! : o 
(*) 
(**) 
ungen(gen([n]; N77 *! 7; *")*t» : oct al - 1]. aen [n]; N'it aftr 
where (*) is Lemma 19(1), and (**) follows by (6.1) and clause z. Ifi > k—3 
the following diagram establishes the thesis: 
? , 
[n]iungen(gen(N*77*!)*73)* : att  — [nkilj — 1] 544 N' 57 : att” 
| € 
. . (** . 
ungen(gen(N*7*1)573)**^ akte -— [j-1+ n]k-j441N'573* ; ot 


where (*) is Lemma 19(ii), and (**) follows by 6.1 and clause z. 
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LEMMA 28 (Diamond Property) Suppose M : a^ 3 M':a* andM:a* 3 
M" : a. Then we can find a term M" : o* such that 


P 
M" : ak 


M':ak 


Proof. Let n’ and n" be the lenght of the derivations r' of M:a* = M': a 
andr” of M : a* = M" : o*. The thesis is extablished by induction on n' + n". 


We give only the modal cases. We write M : a^ -3 M"' : af to denote that 
M :a* 3 M':afF is established with a deduction of lenght n. 


e Both r’ andr” end with clause (iv). 


gen(P)* 


AZo bru 


gen(P")* : Oa* gen(P’)* : Oak 
gen(P"^)* : Da 


where 


nus 


P" . a^ *! 
is obtained by I.H. 


e Both r' and r" end with clause (vii). As the previous case. 


e Both r’ andr” end with clause (x). 


236 SIMONE MARTINI AND ANDREA MASINI 


ungen(gen(.P)*—7) 


`< 


c = lcs! : at 
SN A 


[ - 1k-;aP":a 


[j 7 l]k-j+1 P" : 


Where 
Pd C 
xw . Zz 
is obtained by I.H. and 
[ - Veja Ps a* 2 [j - 1k; P" : o & [j - 1 ja P" : 0 


holds by Lemma 27. 


e Last clause of r’ is (x) and last clause of r” is (vii). 


ungen(gen(P)*—7)* : o 


` 


c E 
— AA 


-Ij P ia 


ungen(Q*-7)* : a 


where 


. , n"—] , , 
gen(P)^7 : na^ 4 Qk): na? 
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Observe now that Q^? = gen(P")*—7, with 
k—3-F1 <n" i k~j+1 
P:a% It! = P" : Qh st 


since no clause for =} may remove a toplevel gen. By induction hypothesis 
we obtain 


P : ak-s4+1 
P" : ot P':gk-1 


P". o^-7*1 


Lemma 27 allows to conclude. 


a 
THEOREM 29 (Church-Rosser) The — relation is confluent. 
Proof. M > N iff M = Mo 3 ... 3 Mn = N with n > 0. The usual 
Church-Rosser diamond can now be closed by using the previous lemma. a 


6.2 Strong normalization 


We reduce the problem of strong normalization for 4°54 to the strong normaliza- 
tion for the system of next definition. 


DEFINITION 30 (Calculus A54) Terms and derivations are inductively defined by 
the following rules. 


DA r r 


M:o* N:r* M:oAr* M:o^r* 


——————— AT ————— AE — NE, 
(M, N):o nT" (fst M) : o" (snd M) : 7* 
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T [r:6*] r A 

M :7# M:0D 7r" N : o" 
— Go — oE 
Mt.M: ao 27 MN:T 

r T 

M : o M : Oo? 
Mime oL PESO Tag e HH 


Contractions (and reduction) for A54 are defined is the obvious way (note that 
modal contractions do not affect the terms, but only the levels in the formulas). 

Derivations (i.e. terms) in A*S4 can be obviously translated into A54, via a 
straightforward forgetting translation ( )?, establishing easily the following propo- 
sitions. 


PROPOSITION 31 T Fa, M : oF I? Fasa M? :c*. 


PROPOSITION 32 Let M : o* > M' : o* bea propositional reduction in M S4; 
then M? : of > M” : o* in ASA, using the same contraction. 


Let A be the intuitionistic lambda calculus with implication and conjunction. The 
types of AS4 can be interpreted into the types of A by means of the following trans- 
lation. 


(b) = p 
(a> 8") = (a^) D (8°) 
(aA B*)* = (a*)* ^(85)* 
(Qa*)* — (aoF*!)* 


PROPOSITION 33 T Hysa M : of => I* +) M : (o*)* 


Proof. Induction on the derivation, the modal rules becoming vacuous steps in A. 
H 


It is well known (e.g. [2]) that À enjoys strong normalization. It is now easy to 
obtain as a corollary the same result for A£ S4. 


THEOREM 34 A'S4 enjoys strong normalization. 


Proof. Let T Fesą M : o* and suppose there is an infinite reduction sequence 
starting from M. Observe now that this sequence cannot be composed (from some 
point on) only of modal contractions, since a modal contraction strictly decreases 
the number of nodes of the proof tree. Hence the reduction sequence contains an 
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infinite number of propositional reductions (that is, contractions of non modal re- 
dexes). But this is impossibile, because these propositional reductions would also 
be reductions of M? (by Proposition 32), which is typeable in A, by the previous 
propositions, while A enjoys strong normalization. a 


6.3 Computations in subsystems of MS4 


While all the results of the previous sections are stated for the stronger system A454, 
a careful inspection of the proofs shows that all the statements which apply to a 
given system still hold. 


THEOREM 35 Let AFL one of °K, M KT, M KA, or M S4.(i) Reduction in X L 
is confluent.(ii) Reduction in A L enjoys strong normalization. 


7 RELATED WORK 


In this brief section we relate our proposal to some of the recent work on natural 
deduction for modal logics. This is not the place for a detailed proof theoretical 
analysis of the different systems and, for this reason, we will discuss only the cal- 
culi allowing a computational interpretation of proofs, the exception being the work 
presented next, which we take into account for its similarity with our system. For 
more detailed analysis of the proof theory of modal logic, we refer to [13] and [17]. 


7.1 The 'constructive' system of Benevides and Maibaum 


Benevides and Maibaum [3], starting from a semantical intuition on Hintikka sys- 
tems, propose a set of natural deduction rules for the D-based systems essentially 
analogous to our proposal. However, their semantical insight seems to hinder a full 
mastering of the level machinery. They introduce in their system redundant rules 
for the interaction between O and the propositional connectives (see the rules R2 of 
their Section 3.2.2), which are derivable in the rest of the system. From a proof the- 
oretical point of view these rules are hardly justified, since they do not obey to the 
tenet: “For each connective, (only) one introduction and (only) one elimination". 
For this reason, it seems impossible to relate these systems to any clean sequent 
calculus for the same logics (see, contra, our Section 4.2) 

They prove the equivalence of their systems to the classical Hilbert style formu- 
lations. However, no notion of normalization of proofs is introduced. It is our belief 
that any study of the constructive content of a logical system cannot dispense from 
the study of proof normalization. If one does not tackle normalization, why not us- 
ing the simple, naive approaches outlined in the Introduction? 
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7.2 The intuitionistic calculus of Bierman, Meré and de Paiva 


Bierman et al. [4] present an approach to the C) modality essentially based on a vari- 
ation of Prawitz proposal. The problematic OZ rule has the form 


M IL. [xı :O01,...,2n : Do] 


Mı : Qa; ... Ma4:Uo4 N:T 
box N with Mj,..., M, for 271,...,2n : OT 


For this system, the authors introduce several reduction rules corresponding to a 
categorical model and show its equivalence with the standard sequent calculus for 
S4. 

Tt is clear that this system has some advantages over Prawitz’s, since the rule OZ 
is not as global as OZp (see Section 4.1) and allows a clearer reduction step. How- 
ever, the approach seems specifically designed for S4, since it is not clear how the 
rule would scale to the normal subsystems of $4. Moreover, many of the critiques 
of the Introduction still apply to the proposed rule. In particular, a good introduc- 
tion rule should construct its conclusion from all its premises and possibly some 
assumptions, a requirement clearly violated by the proposed OZ. 

Finally, [15] shows a normal deduction in the positive fragment of the calculus 
in [4] which does not satisfy the subformula property. It is then necessary to add 
suitable permutative reductions, which are not needed in our (and Prawitz's) ap- 
proach. 


OT 


7.3 The labelled natural deduction of Gabbay and de Queiroz 


A completely different approach is the one proposed by Gabbay and de Queiroz in a 
sequel of papers, e.g. [7, 8], as an attempt to give a general theory of natural deduc- 
tion extending the Curry-Howard isomorphism to a large class of logics (classical, 
linear, relevant, modal). As for modalities, 


it offers a deductive (as opposed to model theoretic) account of the con- 
nections between modal logic and it propositional counterpart when 
world-variables are introduced in the functional calculus of the labels 
(i.e. when a little of the semantics is brought to the syntax, so to speak). 
[8] 

The calculus is a kind of second order system, where second order objects, which 
syntactically are only variables, correspond to possible worlds. The basic rules for 
introduction and elimination of D are: 

T [W : 4] T A 


Mo M : Qc T:U 
_ Oo — 0 
AW.M : ac CAT R(M,T) : o(T) 
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where U stands for an intended collection of ‘worlds’. 

There is an certain analogy between our levels and the world-variables, which 
is especially clear in the case of the logic K. Moving to stronger logics, however, 
obscures this analogy. The characteristic axiom of KT, for instance, is not provable 
as is in their system; they can only prove its modal closure, O(Oø D ø) (it seems 
they would need a kind of second order constants to prove Clo. D o). In the case 
of K4, the proofs of the characteristic axioms are remarkably different in the two 
systems; while their proof consists of just a OZ rule, in our case we have a OE 
followed by two OZ. 
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GABBAY-STYLE CALCULI 


It is well known that there are logics, e.g. classical first-order logic with n variables, 
Ln, that do not have strongly sound and complete Hilbert-style inference systems. 
However, some of the logics of the above kind have weakly sound and complete 
Gabbay-style inference systems, cf. Gabbay [6], Venema [13], [14], and Simon 
[12]. 

In this paper, we will consider £n as a multi-modal logic and give weakly sound 
and complete Gabbay-style inference systems for two versions. These complete- 
ness results follow from representation theorems for the corresponding classes of 
algebras. 


] INTRODUCTION 


Usually logics are defined in a semantical way. That is, one gives a logic by defin- 
ing its language (the set of formulae), the class of models, and the validity relation 
(between models and formulae), or the semantical consequence relation (between 
sets of formulae and formulae). In addition, sometimes we have an interpretation 
(or meaning) function as well. 

The question of whether a logic has a sound and complete inference system, i.e. 
whether we can grasp the above semantical relations by pure syntactical means, 
naturally arises. 

There is a class of logics that have strongly sound and complete inference sys- 
tems, i.e. where the semantical consequence relation can be simulated by a syntacti- 
cal relation. For instance, classical propositional logic and the modal logic 
S95 have strongly sound and complete Hilbert-style inference systems, cf., e.g. 
Andréka, Németi & Sain [4]. Moreover, the calculi referred to above have the fol- 
lowing property: the rules of inference can be applied without any restriction (such 
a restriction could be done, e.g. on the shape of the formulae occurring in the rule). 
These calculi will be called Hilbert-style in this paper, cf. Definition 2. 

However, there are very interesting logics that do not have strongly sound and 
complete Hilbert-style inference systems, e.g. classical first-order logic with n vari- 
ables, Cn, does not have one, cf., e.g. Andréka [1], Németi [9], and [4]. But they 
may have weakly sound and complete inference systems, i.e. validity may be imi- 
tated. (In passing we note that it is an open problem whether £,, has a weakly sound 
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and complete Hilbert-style calculus; we conjecture, it does not have one.) Beside 
Hilbert-style calculi we may consider inference systems in which some inference 
rules can be applied only if certain ("reasonable") condition is met. These calculi 
will be called Gabbay-style, cf. Definition 2. The reason for this name is that in 

Gabbay [6] there is a complete inference system of this kind for a previously in- 
complete logic, and with this Gabbay initiated a series of new completeness theo- 
rems for various logics. In passing we note that Martin Abadi's inference system 
for first-order temporal logic is also Gabbay-style, cf. Sain [10] for latest results 
and for references. 

In this paper, we will prove that various versions of first-order logic with finitely 
many variables have sound and complete Gabbay-style inference systems. In par- 
ticular, we will give calculi for the so-called ordinary and restricted versions of first- 
order logic with n variables. For these logics Yde Venema gave Gabbay-style (or 
unorthodox) calculus in Venema [13], [14] and [15]. However, he uses the differ- 
ence operator D that is defined by equality. Our complete calculi below are defined 
without using D, partially answering question (1) on p.111 in [13]. For a similar 
completeness result for first-order logic without equality see [8]. 


2 DEFINITIONS AND MAIN RESULTS 


First of all, we give the precise definitions of the logics we are dealing with, cf. 
Henkin, Monk & Tarski [7]. 


DEFINITION 1 Ordinary first-order logic with n variables with equality is de- 
fined as the ordered triple 


L3 SS (Fmlc=,Modc=, c=) 
for which the following conditions hold. 


def , 
1. Let V = {vo,...,Un—1} be the set of variables. Let P denote the set of 
atomic formulae, i.e. 


def . , 
P'È fri(o4,,..., 05, ,) i € I&jo,.--,jn-1 < n] 


for some set I; the symbols r; (i € I) are called relation symbols. Then the 
set Fml c= of formulae is the smallest set H satisfying: 


e PCH 
e v; =v; € H for everyi,j En 
e p, E H,i En > (pry), w, wip € H. 
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Sometimes we will use the notation Jip instead of 3vi« and Ej; instead of 
v; = vj. By A(Lio,...im})M Uo... im) € n, we abbreviate 3i, . ..3;,, 9." 
By the set of connectives of Cz, Cn(£z), we mean the set (^, 5,3;, Ei; : 
ij En}. 


2. The class Modc= of models is defined by 


Modc= © ((M,R);:MZe,R;C"M (icI)). 
If M = (M, R;);c; € Modc=, then M is called the universe of M. 


3. Let M = (M, Ri); € Modc=,k € "M and o € Fmlc=. We define the 
ternary relation (M,k) |= «v (sometimes also denoted by M E ¢[k]) by 
induction on the complexity of 9: 


def . 
e (Mt, k) H TilVjos ee -3 Vja) < (kjo, ees Kini) € Ri (i € I) 
e (M, k) Ev; =v; & ki =k; (i,j € n) 


e if V1, V» € Fmlc= and i € n, then 


(M, k) € not (M,k) E i 
(M,k) Edi ^o. £ (Mk) EE i & (M,E) E Vo 
(M,k) E Sud, 4h (3k e"M)(vj Zi) 

(ki, = kj & (Mk) E vi). 


If (Mt, k) E «o, then we say that the evaluation k satisfies the formula p in 
the model M. We say that M satisfies p, in symbols M E-cz v, iff for every 
k € *M, (M, ky E- v. The interpretation, or meaning, of a formula ọ in 
a model M is defined as 

mean qp = (k € "M:(M,k) E v). 


Instead of mean mọ sometimes we will use the notation ^^. 


Restricted first-order logic with n variables with equality, " C7, differs from 
the corresponding ordinary logic in the following: in restricted logic the order of 
the variables in atomic formulae r(vo, . . . , vs 1) is fixed. 


If no confusion is likely, we will omit the subscript C7 in Eca, etc. 


1 This definition makes sense, since the quantifiers commute, cf. the semantics below. 
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DEFINITION 2 By an inference system | we mean a finite set of axiom schemata 
and rules of inference. Generally, a rule has the form 


H Ai... F An , 
—FA provided C 
where Ao, ..., An are formula schemata and C is some "reasonable" condition. A 


rule is called Hilbert-style if the condition C is empty, otherwise Gabbay-style. -- 
is a Hilbert-style inference system if all of its rules are Hilbert-style, and Gabbay- 
style otherwise. 

The inference system | is strongly complete with respect to the logic L if for 
every set T U {4} of formulae, 


THe Er Ece 


and strongly sound if 
Tky>T Ec o. 
In both cases, if T = Ø, then we use the adverb weakly instead of strongly. 


REMARK 3 In the above definition of inference rules the requirement that the con- 
dition C must be ‘reasonable’ is rather vague. We made this restriction to exclude 
some useless rules (e.g. C might be the condition ‘Ag is valid’). In this paper, C is 
always the condition that ‘some atomic formula does not occur in the conclusion 
Apo of the rule’, an easily decidable syntactic property. 


Let us formulate our main theorem. 

THEOREM 4 Let £ be one of the following logics: 
1. ordinary first-order logic with n variables with equality, C7, 
2. restricted first-order logic with n variables with equality, "LF. 


Let us assume that there are infinitely many relation symbols in the language of £. 
Then there is a Gabbay-style inference system that is weakly sound and complete 
with respect to L. 


3 MODAL FORMALISM AND CONNECTION WITH ALGEBRAS 


We will prove Theorem 4 applying an algebraic representation theorem. Represen- 
tation theorems for algebras state that certain axiomatically given algebras are iso- 
morphic to set algebras (e.g. algebras whose elements are relations, and the opera- 
tions are natural operations on relations). Algebraic representation theorems corre- 
spond to logical completeness results, cf. [7] and [4]. We have to give an alternative 
definition of our logics, so that the algebraic counterparts of these logics have the 
appropriate similarity type. First we define a new logical connective. 
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DEFINITION 5 Let n be fixed and o. € "n, i.e. a function from n into n, then we 
interpret the connective S,, called substitution, as 


(M, k) H Sop S (M, ko) E g. 


For example, if ø sends 1 to 0 and leaves everything else fixed, 


def 
(M, (ko, ki, k2)) E Seq e (M, (Ko, ko, k2)) = p. 
Thus the intuitive meaning of S, (vo, v1, V2) is (vo, vo, v1 ).. 
We will need the following fact, the proof of which is simple and omitted. For 
more details see [8]. 


COROLLARY 6 I[fwe define ordinary first-order logic with n variables by adding 
the connectives Sg (o € ™n) to the set of connectives of the corresponding re- 
stricted logic, then the new definition is equivalent with the old one in the following 
sense: there is a one-one translation function between the sets of formulae such that 
for every formula and every model, the model satisfies the formula iff it satisfies the 
translation of the formula. 


By the above corollary we can simply omit the variables in any formula. Thus, 
first-order logic can be considered as a multi-modal logic, cf. Venema [13] and [15]. 
The existential quantifiers J; (i € n) are diamond type modalities, and the equa- 
tions E;; (i,j € n) are modal constants. Then the possible worlds are the evalua- 
tions k of the variables, and two worlds k and k’ are related by the ith accessibility 
relation if (Vj # i)k; = k;. Ei; holds at k if k; = kj. Substitutions can be treated 
in a similar way. 

Now we define some algebraic notions concerning logics. We will use the nota- 
tion of Henkin, Monk & Tarski [7] and Sain & Thompson [11]. Thus I, H, S, and 
P denote the operators taking isomorphic copies, homomorphic copies, subalge- 
bras, and isomorphic copies of direct products, respectively. We denote the classes 
of n-dimensional cylindric, representable cylindric, and quasi-polyadic equality al- 
gebras by CA,,, RCA,,, and QPEA,,, respectively. We will use the notions of variety 
and of discriminator variety, cf. Burris & Sankappanavar [5]. Given a variety V, 
we will denote the class of its simple elements by SimV. 

From now on, by a logic we will mean any of the logics in Definition 1. 


DEFINITION 7 The formula algebra of a logic L is defined as 


def 
F = (Fmle., c) eese) . 


The corresponding class of algebras for a logic L is 
Alg£ $8 (meanj4" F : M € Modz}, 


where mean m" F is the (homomorphic) image of the formula algebra F along the 
interpretation mean m of the formulae. 
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We mention, without proof, the following consequence of the above alternative 
definition of our logics. For more detail about the correspondence of logics and al- 
gebras, how to translate algebraic equations to logical formulae and vice versa, etc. 
we refer to [4], [7] and [9]. 


COROLLARY 8 Alg" £z and AlgCz generate the varieties of representable CA, 's, 
and representable QPEA,,’s, respectively. 


DEFINITION 9 Let a be any ordinal and V, be one of CA,, or QPEA,. Let A € 
Va and a € A. We say that a is rectangular iff 


C(r)@* C(ay@ = C(rnaya 


for all finite subsets T and A of a. 
We say that A is rectangularly dense iff 


(V0 Za € A)(30 Z b € A)(b < a & b is rectangular). 


Let us formulate the representation theorem referred to above, which is proved 
in Andréka et al. [2]. 


THEOREM 10 Let a be any ordinal and V, € {CAa, QPEA,}. Let RV, denote 
the class of representable elements of V, and VR, denote the class of rectangu- 
larly dense elements of V œ. Then 


RV, = SPVR,. 


4 PROOF 


Now we are ready to prove Theorem 4. In the following proof, we will define a 
Gabbay-style calculus given by the translation of the axioms for CA, , and a rule 
corresponding to rectangular density. The intuitive meaning of this rule is the fol- 
lowing: a given formula ¢ is satisfiable iff there is another formula ~ such that 


1. wis satisfiable, 
2. p has a special form (viz., it is rectangular), 
3. w — wis satisfiable. 


Proof. [Theorem 4] We give the precise proof for "£7 (i.e. for 4.2), and indicate 
the differences for the other logic. 
First we note that for A € CA, and a € A, 


a is rectangular <> A E 7(a) = 1 


GABBAY-STYLE CALCULI 249 


where 
T(a) = —C(n) — (- Ito ups | € n} + a), 


cf. [2]. Further, by setting Va = CA, in the representation Theorem 10, we get that 
the rectangularly dense elements of CA, are in RCA,,. 

Let Az be a finite set of equations axiomatizing CA,. Let the inference system + 
be defined as follows. Its set of axiom schemata is the translations? of the elements 
of Az into the language of " Cz : for every i, 7, k(n, 


1. enough propositional tautologies, 
. di(p V V) + diy V div, 
. p> dit, 
. didi > die, 


. Bix, 
. Bin © 3;(Eij ^ Ej) if 7 g {i,k}, 


2 
3 
4 
5. Ji3j;o € 3jdiv, 
6 
7 
8. Ej AA(Ej ^p) ifi j, 


its inference rules are Modus Ponens, Universal Generalization and the following 

rule: 

F(p^T(p^-v) 3v 
Fo 

where 7(p ^ ^) is the translation of T(p ^ ^): 


(N-rule) provided p ¢ o 


73(^(^ A 3. (yy (P ^ ^) V (p ^ ^9)), 
iEn 
and p € ọ denotes that p is an atomic proposition not occurring in the formula q. 
SOUNDNESS: We check the validity of the N-rule. Assume that ¢ is not valid, i.e. 
that there are M and k such that 


(M, k) E o. 
Then 2^ £ Ø, so we can choose (ao,...,a5 1) € ^v^. Let p be a relation 
symbol not occurring in o, and let p^ = {(ao,...,@n—1)}. Let the evaluation of 
variables (ag, ..., 4.1) be denoted by a. Then 
(M,a) E p^ v. 


"This translation is defined by substituting formula schemata for algebraic variables, and 
replacing algebraic operations by the corresponding logical connectives. 
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We claim that for every evaluation k, 


(M, k) F T(pA 79). 


This amounts to prove that for an arbitrary evaluation k’, 


(M, k’) E 2 A 3s quy (P^) V (P^). 


1€n 
To prove this, first let k’ = a. Then 
(M, k') E p ^ 9. 


If k’ Z a, then for some j € n, a; Æ k;, whence 


(M, k') E 3 iy (p ^ 2). 


Thus in both cases, 


(M, k’) E 5 A Ans py (P^ 79) V (p^) 


i€n 


whence for any evaluation k, 


(M, k) E 35) A Insip (PA 2) V (p ^ ^9), 


1En 


(M, k) E T(p ^ ^o) 
as desired. Then we have 
(M,a) p ^^v & (M,a) E T(p ^-^) 


whence 
(M,a) E (P^ T(p ^ ^9)) > v. 


So the rule is valid. The other rules and the (instances of the) axiom schemata are 


clearly valid, since Alg” £z = RCA, C CA, 


COMPLETENESS: First we prove that the syntactical Lindenbaum-Tarski algebra 
A is a rectangularly dense element of CA,. Clearly A € CA,. Nowlet O0 Z a € A. 
Then a is the equivalence class of a formula v such that I’ ^v. By the N-rule above, 
V (p^T(p^wv)) —- V whenever p ¢ v. Let us fix such a p, and let b be the 
equivalence class of the formula (p ^ V) ^ T(p Aw). Then b Z 0 and b < a. It 
remains to show that b is rectangular, i.e. 7(b) holds in A. In fact we will show that 


CA, E: T(x - T(x)) = 1. 
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Let B be a simple element of CA,,. Since 7 is closed under the cylindrifications, for 
every x € B, 
T(x) € (0,1). 


Now assume that T(z) = 1 holds in B. Then 
T(x- T(x)) = T(x) = 1. 


If r(x) = 0, then 
T(z-T(x)) = 7(0) 2 1, 


since 0 is rectangular. Then SimCA,, } r(z - T(z)) = 1 whence 
CA, E T(z:T(x)) = 1, 


since CA,, is a discriminator variety. In particular, 7(b) = 1 for the b above. Thus, 
for arbitrary non-zero a, we found a non-zero b < a that is rectangular. This means 
that .A is rectangularly dense, so it is in RCA, by the representation Theorem 10 
above. 

Now assume that 7 y. Then A E y’ = 1 where y’ denotes the equivalence class 
of y. Then, since by the above argument A € RCA, = HSPAlg" Cz, we have 


Alg" C, Fy — 1, 
i.e. there is a model M such that o^ 4 "M, whence 
M E v. 


Thus q is not valid whenever it is not derivable. 

The proof is essentially the same: just use QPEA,, instead of CA,,, and its rep- 
resentable subclass instead of RCA,,. 

So Theorem 4 has been proved. iu 


REMARK 11 The N -rule above is only weakly sound. That is, there are formulae 
p and w such that ù = (p ^ T(p ^ ^o)) > v but v KF qv. Indeed, let p be 
a non-valid formula, and let p and M be such that sp“ C —pM. Then ^p E 
(p NT(p A 7)) > y. On the other hand, ^p F q. 
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PART III 


TRANSLATION-BASED PROOF SYSTEMS 


H. J. OHLBACH, R. SCHMIDT AND U. HUSTADT 


TRANSLATING GRADED MODALITIES INTO 
PREDICATE LOGICS 


1 INTRODUCTION 


From Minsky’s early frame systems, which were defined purely operationally, and 
Brachman’s KL-ONE knowledge representation system [4, 35] to the 
language ALC of Schmidt-SchauB and Smolka's [28] paper there has been a 
continuous trend in designing knowledge representation systems more and more 
according to logical principles with clear syntax and semantics and logical infer- 
ences as basic operations. ACC in particular is a language with the usual logical 
connectives M, Ll, ^ and the additional constructs (all R C) and (some R C). For ex- 
ample, the following is an ALC definition which defines a ‘concept’ 
proud-father as a father all of whose children are successful persons. 


proud-father = father M (all has-child successful-person), 


The fragment of ACC that includes the operations N, U, ~, all, some is just a vari- 
ant of the multi-modal logic K(m) [27]. The concept (all R C) corresponds to [R]C 
where the relational term R (a 'role' in KL-ONE jargon) is the parameter of the 
modal operator, and is interpreted as a binary accessibility relation. ACC is still 
limited in its expressiveness. In pure ACC it is not possible to define concepts like, 
for example, a city as a place with more than, say, 100000 inhabitants. There are 
extensions of ALC, like ACCAM, with additional operators, called ‘number restric- 
tions’. 


(1) city = place r1 (atleast 100001 inhabited-by people) 


is a suitable ACCN definition. (atleast n R C) and (atmost n R C) restrict the 
number of so-called ‘role fillers’, i.e. they restrict the number of elements in the 
range of the relation R to at least n and at most n, respectively. The corresponding 
modal logic of ACCN is the multi-modal version of the system of ‘graded modal- 
ities’, which was introduced by Goble [17] and Fine [13, 14] and which is investi- 
gated in Fattorosi-Barnaba and de Caro [12, 11], and van der Hoek [34, 33]. 
Graded modalities are modal operators indexed with cardinals which fix the num- 
ber of worlds in which a formula is true. The formula O,,q is true in a world iff 
there are more than n accessible worlds in which ¢ is also true. The dual formula 
Oy, given by 294-9, is then true in a world iff there are at most n accessible 
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worlds in which ~g is true. More formally, the semantics is defined in terms of one 
accessibility relation, say R, by 


M,zbEg Os iff |y|R(zy)&M,yEg epn 
Mx Hg Ong iff |(y| R(z,y) & Myke ^v) <n, 


where M denotes a model and x, y denote possible worlds. For any set A, |A| 
denotes the cardinality of A. This semantics is very natural and intuitive, but it 
has one disadvantage. All inference systems based on this semantics, in particu- 
lar, tableaux systems, deal with these ©,,-operators by generating a correspond- 
ing number of terms explicitly. For example, the formula O09 ooopeople triggers 
the generation of 100001 constant symbols as representatives for the individuals 
denoting people. Except for counting these constant symbols and comparing the 
length of lists, known tableaux systems do not provide for arithmetical computa- 
tion. In particular, reasoning with symbolic arithmetic terms is impossible. For ex- 
ample, in tableaux systems the formula O,,41p — Op which is true for all n can 
only be verified for concrete values of n, but in general it cannot be verified for 
arbitrary values of n. 

This is not the case for the Hilbert system axiomatizing the graded modalities. 
It is formulated with arithmetical terms, and in principle, this allows for invoking 
arithmetical computations. However, Hilbert systems have other disadvantages that 
makes them unsuitable to form the basis for automated reasoning. For example they 
do propositional reasoning just with modus ponens and the instantiation rule. Even 
for trivial theorems one gets large proofs and the search space is very unstructured 
and enormously big. 

A direct translation of formulae with graded modalities into predicate logic re- 
quires the axiomatization of finite domains. This is feasible only for small cardi- 
nalities. We may translate sentence (1) as follows: 


Vrcity(r) «€» place(x) ^ 


Jyi...9100000 Yı  y2^V1 É Us ^ ... AY É Y100001 A 
Y2 Æ Y3 ^ ... ^ ya É V100001 ^ 


Y100000 Æ 9100001 ^ 
inhabited-by(x, yı) ^ ... A inhabited-by(x, y100 001) ^ 


people(y1) ^ ... ^ people(yioo 001). 


The translation of ©,,-expressions requires (n + 1)n/2 equations. Even for small 
n this is more than current theorem provers can cope with. One immediate alter- 
native is introducing set variables and a cardinality function. For sentence (1) an 
alternative formulation is: 


Vz city(x) «€»  place(z) A 3Y (|Y |)100000 ^ 
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Vy (y € Y — (inhabited-by(z, y) ^ people(y)))). 


This is not really a feasible alternative, for the axiomatization of the cardinality 
function then requires the above (n -- 1)n/2 equations, and this for every n: 


VY Y)n © Iyi... Yn y1€Y^...^yau1€Y^ 
yi E Y2 NY £ Vs ^ ... ^V É Jai ^ 
ya E£ ya ^ ... ^a É Yn A 


Yn F Yn+1- 


In this paper we present a two step translation of graded modal logics into pred- 
icate logic. In the first step, we transform graded modal logics into another multi- 
modal logic with standard interpretation. In particular, we accommodate modal log- 
ics with graded modalities in a multi-modal logic with two kinds of modalities: 


(i) (n), [n] characterized by a relational structure (over a universe U) of infinitely 
but countably many different relations R,, (n € INo), and 
(ii) ©, O characterized by a designated relation FE. 


We translate formulae of the form ©, into (n)Oy and the intuitive idea underly- 
ing this translation is this: If ọ is true in a set Y of worlds with more than n elements 
then we introduce an accessibility relation Rn that connects the actual world and a 
world wy which we can think of as being a representative for the set Y . This defines 
the (n)-operator. Oy and its associated accessibility relation E expresses that q is 
true in all the worlds of the set Y . E connects the world wy with all the worlds in 
Y and can be thought of as the membership relation. Thus, (n)Oy encodes ‘there is 
a set with more than n elements (encoded by (nY) and y is true for all the elements 
of this set (encoded by OY’. Our first problem now is to find a sound and adequate 
axiomatization of the modalities (n), [n], © and O as to capture the graded modal- 
ities ©, and O. It turns out that this is not entirely possible. The axiomatization 
we present in this paper has some non-standard models which do not reflect our in- 
tuition. But this does no harm, as we will see. We show: A formula ¢ is a theorem 
of a graded modal logic iff the translation of ọ is a theorem in the new logic. This 
translation is only an intermediary step in a translation to predicate logic. 

In the second step, we translate the multi-modal logic into a predicate logic using 
the functional translation of [23, 24, 10, 18, 2, 36]. The reason for using the func- 
tional translation instead of the usual relation translation is this: The multi-modal 
logic of graded modalities can have frame properties that are not first-order defin- 
able in terms of R,, relations. However, the frame properties can be formulated in 
a weak fragment of second-order logic and it is possible to formulate them in an al- 
ternative adapted language as first-order expressions. The alternative language is a 
functional language in which binary relations are encoded as sets AF of functions. 
The set AF of accessibility functions defining the accessibility relation R is given 
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by: 
R(z,y) à 3f € AF y = f(x). 


This sequence of translations of a system of numerical modalities first into an- 
other multi-modal logic and then into a many-sorted predicate logic (using the func- 
tional translation) yields an axiomatization, in particular, an axiomatization of prop- 
erties of finite sets. Instead of counting symbols this system uses arithmetical rea- 
soning. 

This paper is structured as follows. In Section 2 we give a short overview of 
modal logics with graded modalities. In Section 3 we introduce the normal multi- 
modal logic for accommodating graded modal logics. We define a translation from 
logics of graded modalities into the multi-modal logic that we exhibit to be sound 
and complete. In Section 5 we present the functional translation of the multi-modal 
logic into predicate logic. We conclude with Section 6 in which we apply the new 
techniques to the knowledge representation language ACC. 


2 GRADED MODALITIES, THE SYSTEM K 


Normal modal logics like K, T, S4 and S5 have one modal operator, the necessity 
(or box) operator O. The possibility (or diamond) operator © is defined as its dual. 
By definition, 

Oy e Ug. 


In [17] Goble investigates modal logics with more than one modality. His logics 
have a fixed and finite number of modalities. Each modality represents a different 
grade of necessity. For example, the formula 


Nme A Naw 


for positive integers m(n, is read to mean vj is more necessary than q. Kit Fine [13, 
14] generalizes this idea and introduces modal logics with numerical modalities. 
These are now commonly referred to as modal logics with graded modalities. In 
a series of papers Fattorosi-Barnaba, de Caro and Cerrato [12, 7, 11, 6] rediscover 
and analyze various modal logics of graded modalities. 

Recent investigations of graded modal logics are by van der Hoek in [34] and 
[33]. Together with de Rijke he applies graded modalities to linguistics and artifi- 
cial intelligence. In [31] they show that generalized quantifiers can be modelled 
with graded modalities. In [32] they also show that certain numerical quantifier 
operations available in KL-ONE-based knowledge representation languages can be 
modelled with graded modalities. 

In this paper we adopt the definition of the graded modal logic K of van der 
Hoek [34]. K is an extension of the normal modal logic K with graded modal- 
ities. Formally, the vocabulary of K consists of the set of propositional symbols 
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P, P1, P2, - - -, q, q1, q2, - . - the constant L (falsity), the logical symbol — (impli- 
cation) and the modal operator symbols ©,, for n € INo, and the usual punctuation 
symbols. Formulae of K have the following forms: 


Dd. l, py, On. 


As usual we define ^o (negation), T (truth), o V w (disjunction), p ^ v» (conjunc- 
tion) and «v (double-implication) as abbreviations for pol, =L, 
ay — ww and ^9 V ~y, respectively. Furthermore, Cl; 4» abbreviates -©O,,- for 
n > 0, Oloy abbreviates Clo^o, and O!,¢ is the abbreviation for O,_1y A 
Ong with n)0. 


On isreadtomean is true in more than n accessible worlds, 
O,y isreadtomean gis true in at most n accessible worlds, and 
Olny  isreadto mean cis true in exactly n accessible worlds. 


DEFINITION 1 The system K of graded modalities is defined by the following ax- 
ioms 


Al the axioms of propositional logic 

A2 Fg Onti 9 One 

A3 kg Oole > V) 2 (949 — Ony) 

A4 kg Doc( ^ y) 3 (Ohne ^ Ct) 4 Olmo V v)) 


together with the uniform substitution rule, Modus Ponens, and the necessitation 
rule for Oo: 


US ify is a theorem so is every substitution instance of q, 
MP if Fe e and Fg pw then Fg v 
N iff p then" e Ooy. 


Observe that Oo and Op coincide with the standard modal operators © and O. K is 
therefore a subsystem of K. 


THEOREM 2 [34] The following are theorems of K. 
AS Fg Doly > v) + (Bae On) 

A6 Fg On(PAY) > (OnpA On) 

A7 Fg OnpAO!me +L forn#m 

A8 Fe Onay 4 (Oby V Ohe V ... V Olay) 
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A9 Fg 7Wnl(P VY) > 7Ony 
A10 Fe Ontm(Y V V) > (Cap V Om) 
All Fg Ol.p^Omp—lL form>n 


A12 Fg On(PAY) A Om(G ^ ^V) 3 Oum 
.( V denotes exclusive-or.) 


The semantics of K is given by a frame F = (W, R) consisting of a non-empty 
set W, called the set of worlds, and a binary relation R over W, called the acces- 
sibility relation. We define a model (based on a frame F) of K as a triple M = 
(W, R, V). V denotes a valuation function mapping propositional variables to sub- 
sets of W. Truth in a model M for formulae of K at any world z is defined (in terms 
of the satisfiability relation Eg ) as follows: 


M, xg P iff rc€cV(p) 
(2) Mark Ong iff |{y E W|R(z,y) ^ M,y Eg vj)ln 
M,rzbEg Ony iff [yeW|R(z,y) ^AM,y Pg ~p} <n 


and as usual for the other connectives. For any binary relation R, let R(x) denote 
the set of images of z under R. That is, define R(x) = (y | R(x, y)}. Then, satis- 
fiability of O4q and O,,y can be reformulated as follows: 
M, T Ex Ong 
iff 3Y C R(z) with |Y |)n and Vy € Y : M,y Fg € 
(3) 
M,a Fx Qo 
iff VY C R(x) with |Y|)n, 3y € Y : M,y Fg v. 
(The proof is routine.) It is now easy to see that the usual duality for box and dia- 


mond also hold for O, and Oy, i.e. we have Ony € 7On7y. _ 
A formula ọ is a K thoerem, i.e. Fx y, iff y holds in all worlds of all K frames. 


THEOREM 3 The axiomatization of K is sound and complete, i.e. for all formulae 
v, we have 


tee if Fg 
A proof can be found in [12]. E 
In the remainder of this paper we assume the formulae of K to be in negation nor- 


mal form which can be obtained by systematically applying the following equiva- 
lences from left to right. 


ayyy) è ^e^ov 
^é((e^v) e€ yyw 
QV è Vy 
pep o (e2V)^(v ow) 
lo e On 
WMnp € Up 
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3 FROM GRADED MODALITIES TO MULTI-MODAL LOGIC 


In this section we present a new interpretation for K and investigate its logical coun- 
terpart. 

A formula Ony of K is interpreted as an expression in which a subset of the ac- 
cessible worlds with more than n worlds is selected. More concretely, the formula 
Ong is true in a world z iff there is a set of worlds (a subset of W) accessible by R 
from x containing more than n worlds in which ¢ holds. Equivalently, Ong is true 
in x iff there is subset Y of the range of R from x with cardinality strictly greater 
than n such that is true in every world in this subset, see (2). In our alternative in- 
terpretation of K we introduce a new class of worlds Wy, each world representing 
subsets of accessible worlds of W . That is, we represent the set Y by a designated 
world in this new class of worlds. Furthermore, instead of having just one accessi- 
bility relation R, here, we have for each n € IN, a different accessibility relation 
Rn. Their domain is W and their range is restricted to Wy . More precisely, for any 
n, Rn relates worlds in W to those elements in Wy which represent sets (of worlds 
in W) of cardinality greater than n. And finally, there is an additional designated 
accessibility relation, denoted E for ‘element of’, which relates the new kind of 
worlds Wy to the worlds in W again. The relation E represents the element-of re- 
lation (strictly speaking the converse of the ‘element of’ relation) between a subset 
Y of W and its elements. For example, consider the formula O3. According to 
the definition of the previous section O3¢ is true in a world z iff there are at least 4 
worlds to which x is R-related. This definition is depicted in the first picture below. 
The second picture depicts our new alternative view. 


yı yı 


WE Nee 


ya Y4 


The relation R is replaced by the relational composition of the two new relations 
H3 and E. In the process we have introduced a new world which we labelled Y as 
it is meant to represent the set of worlds y1, y», y3 and y4. 

The alternative semantics for K sketched above characterizes a new graded 
modal logic which we describe now. We call the system Kg. It is a normal multi- 
modal logic system with graded modalities. In this logic the operators ©,, are re- 
placed by a combination of two operators. Kg is more expressive than the system 
K. Nevertheless, it has similar properties as K as we will show below. 

Our system Kg differs from an alternative translation into a multi-modal logic 
‘Lcount’, developed by [1]. In their system there are n-place operators (n) with 
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semantics 


M,z E (n)gi,...,9n iff there are distinct y1,..., yn such that 
M,yı F v1 and ... and M, yn F Pn 


Calculi based on this semantics, however, seem not to be much different to the 
calculi based on the original semantics for graded modalities. In a corresponding 
tableaux system one has to introduce witnesses for the worlds again, but this is what 
we want to avoid. 


3.1 The system Kg 


The language of Kg is that of K with the graded modalities ©, and L1, replaced 
by the symbols (n), [n], © and O. Formulae of Kg have the following forms: 
Did. L, py, (n)y, Ov. 


As in Section 2 we define the classical connectives in the usual way. The duals of 
(n) and © are abbreviated as follows: For n € No, [n]y abbreviates 2(n)— and 
Oy abbreviates 2O-. The intended meaning of (n)q is, 


iQ is true in some world accessible by the binary relation Rn. 
The intended meaning of Oq is 
| i is true in some world accessible by the binary relation E. 


We call (n) and [n] the numerical operators and O and O the membership opera- 
tors. The relations Rn and E are defined as sketched above. Namely, W forms the 
domain of the R,, and the co-domain of E and the new class of worlds Wy forms 
the co-domain of the A,, and the domain of E. Dually, the intended meaning of 
[n] is 

y is true in all worlds accessible by Rn. 
And the intended meaning of Oy is 

4 is true in all worlds accessible by E. 


So, the syntax (nọ (respectively [n]y) is the shorthand for (R,,)y (respectively 
[Raly), Ov (respectively Oy) is the shorthand for (E)« (respectively [E]y). 

K-formulae of the form ©, and O,,y can be formulated as K p-expressions of 
the form 


(n)Oy and [n]Og, 


respectively. The logic Kg is more expressive than K. It permits arbitrary combi- 
nations of modal operators, not only alternate combinations of necessity and possi- 
bility operators. For example, [3][4]OO is a well-formed formula of K g, although 
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it may not make much sense in our intended semantics. However, there are combi- 
nations of modal operators which have no equivalent formulation in K, but which 
have interesting applications. Here is an example of such a formula: 


[10](football-team — O football-player). 


It says that, if there is a set with more than 10 elements for which the proposition 
football-team holds then the proposition football-player must be true for all its ele- 
ments. In this way we can distinguish between notions like teams which we inter- 
pret as sets and notions like players which we interpret as elements. 

We now give a Hilbert axiomatization for Kg and investigate its characteristic 
frames. 


DEFINITION 4 The axioms and rules of the system Kg are: 
N1 the axioms of propositional logic and Modus Ponens 


N2 the K-axioms for [n] and D: 


Fe, [n](e 9 v) 2 (nly 2 [n]) Fg, Alyy) (0% > OY) 
N3 the necessitation rules for |n] and Q: 
Ife, 9 theng, [nlp ifte, Y theng, Op 


N4 Fe, [0]Oy- [n]Oy 

N5 Fg, (n)Oyp > (n) Ov 

N6 Fg, [ny — [n + 1]o 

N7 Fg, (nc m)U(e V v) 2 (n)Oe V (m)Uy) 

N8 Fg, ((n)Q(y ^ v) ^ (mOl ^ 2y)) 2 (n +m + 1)0 

Although the box operators can be treated as abbreviations in terms of diamond 


operators, or vice versa, we use both operators and allow for arbitrary conversions 
back and forth with 


[nlp e -m^v 
Op € 39-9. 


N1-N3 are the basic axioms for every normal modal logic. N4 captures that, if 
something holds for every set of worlds with more than zero elements, that is, if it 
holds for every non-empty set of worlds, then it holds also for all sets with more 
than n elements. This means, the composition Rn ; E for n arbitrary is a subrela- 
tion of the composition Ro ; E. N5 ensures that no set with more than n elements 
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is empty. A contrapositive version of N6 is (n + 1)Oy —> (n) Oy. It captures that 
sets with more than n 4- 1 elements are sets with more than n elements. 

N7 corresponds to A10 and is a bit more complicated to explain. As an example 
suppose n = 2 and m = 4. For these values N7 is 


(6)G(v V v) 2 (2)Ge V (4)By) 
which is equivalent to 
((6)O(y V v) ^ 2(2)B9) ^ (4)B. 


In words, if there are more than 6, say 7, worlds in which the formula y V y is true, 
but it is not the case that y holds in more than two worlds (i.e. ^ is true in all but 
possibly two worlds) then in the remaining 5 of 7 worlds v is true. The axiom says 
that every (n + m)-element set can be decomposed into an n-element set and an 
m-element set, but note, the axiom is slightly stronger. 

The intuition underlying N8 is the following: Suppose there is a set Y; with at 
least n + 1 elements where q ^ «v» holds and there is another set Y> with at least 
m + 1 elements where y ^ ~y holds. Since and ^y cannot hold simultaneously 
in one world, Y; and Y? must be disjoint. Thus, y holds in Y? U Y»? which is of 
cardinality at least n + m + 2. Therefore, (n + m + 1)Q is true. 

Now we turn to the semantics of Kg. The K-axioms and necessitation rules al- 
low us to use the standard Kripke semantics. We choose the Kripke semantics for 
the multi-modal logic K(;, where m is the number of modal operators. Note that 
Kz has infinitely but countably many modal operators. A K g-frame is a relational 
structure 

F = (W, {Rn}neno ; E). 

W is a non-empty set of worlds. The R» are binary relations over W (each is as- 
sociated with a modality (n)) and E is a designated binary relation over W (as- 
sociated with the modality ©). The relations satisfy the properties N4Q—-N8 given 
below. A model of Kg based on a frame F is a pair M = (F,V) where V isa 
function mapping propositional variables to subsets of W. Truth and satisfaction 
for the propositional fragment of Kg is defined as for the propositional fragment 
of K. See (2) in the previous section. A modal formula is satisfied (is true or holds) 
in a world x iff depending on its form the following holds: 


Mr Fx, (m) iff there is ay such that R,(z,y) and M, y Fg v 
M,x Eg, [nly iff forally such that R,(z, y), My P v 

M,x Fg, Oy iff thereisa y such that E(z, y) and M, y Fg Y 
M,x Fx, Ov iff forallysuchthat E(z, y), M, y Fg o. 


A formula is a tautology if for any frame F the formula is satisfied in all F-based 
models. 

The following are the characteristic properties of K p-frames that correspond to 
the axioms N4-N8: 
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N4 Vayz ((Ra(z,y) A Ely, z)) > 3u (Ro(z,u) A Vv (E(u, v) 2 v = z))) 
NS’ Vay (Rn(z,y) 2 3z Ely, z)) 
N6' Vay (Ra (o, y) > Rn(z, y)) 


N7’ Vay Rat+m(2, y) > 
Vfg3uv (Rn(x,u)—> E(u, f(u)) ^ 
Rn(x,v) 2 E(v, g(v))) 2 
(FE, (z, u) ^ Rm(z,v) A Ely, f(u)) A f(u) = g(v)) 


N8’ Vayz (R,(z,y) ^ Rm(2,z) ^ Vu (Ely, u) 2 AE(z,u)) 2 
Ju (Rn4m4i(z,v) ^ Vw (E(u, w)  E(y,w) V E(z, w)))) 


for any n, m € No. 

We computed these properties with a tool, called SCAN!, which is an imple- 
mentation of the quantifier elimination algorithm of Gabbay and Ohlbach [15]. To 
this end we used the standard Kripke semantics as translation rules for translating 
the axioms into predicate logic. In this relational translation (this is the standard 
translation ST of van Benthem [29, 30]), the formula variables become univer- 
sally quantified predicate variables. The quantifier elimination algorithm produces 
for these second-order formulae equivalent formulae without predicate quantifiers. 
That means ST(N4) + N4' etc. This procedure guarantees soundness of the seman- 
tics with respect to the axiom system, i.e. 


(4) if Fe, o then Fx, v. 


If all these “frame axioms’ (N4—N8) were first-order then the Sahlqvist Theorem 
[26, 29, 30] would ensure completeness of this frame class relative to the axioms. 
Unfortunately, N7' is again second-order. Therefore we have to prove completeness 
explicitly. We do this indirectly for translated K formulae by using the complete- 
ness of K and the soundness and completeness of the translation into Kẹ which is 
proven below. General completeness for arbitrary formulae is still open, but for the 
purpose of our translation, this is fortunately not necessary. 

The correspondence property N4’ states that all singleton subsets of the set of 
worlds accessible by R;, are uniquely represented by a world accessible by Ro. N5' 
asserts that every world accessible by R, leads via E to another world. We say E is 
weakly serial. (Recall, a relation R is said to be serial (or total) iff Vr dy R(x, y)). 
By N6' the set {Rn }neno of Rn relations forms a linear order with Ro being the 
largest element, since for any m)n, Rm is a subrelation of R,. 


! SCAN is accessible via World Wide Web at 
http://www.mpi-sb.mpg.de/guide/staff/ohlbach/scan/scan.html. 


This is a WWW interface for activating the program remotely. We invite the reader to use 
the tool and verify the above correspondence properties for N4Q—N8. 
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The correspondence property N7' of N7 expresses intuitively that every set y 
with more than n + m elements can be decomposed into a set u with more than n 
elements and a set v with more than m elements, and if y happens to have exactly 
n +m + 1 elements then u and v overlap in at least one element. 

N8’ expresses, as already mentioned, that for disjoint sets the cardinality of their 
union is the sum of the cardinalities of the sets. 

For a better understanding of the frame properties it is helpful to think of the 
variable y in Rna (x, y) and E(y, z) as representing a set Y , Ra (x, y) as representing 
that the cardinality of Y is greater than n, and E(y, z) as representing that z is an 
element of Y. Then N4’—N8’ represent: 


NA" VYz((lY)n^zeY)o(z)cY 
NS” Vy(|Yl)nY z 2) 
N6” VY (|Y|)n 4- 1 |Y |)n) 


N7” VY (|Yl)n - m 
Vfg SUV (|U|)m ^ |V])m) 3 
if f selects from U and g from V then f(U) € Y ^ f(U) = g(V)) 


N8” VYZ((IlYn^|Z)m^Y NZ 2 2)3 
JV (Vn - mc 1^ V CY UZ)). 


We can show that the standard class of frames associated with Kp have the 
expected structure, namely that all worlds accessible by R„ have more than n E- 
successors. However, non-standard Kg-frames exist which do not have this 
intended structure. The problem is, we cannot enforce in a Hilbert system that R1- 
accessible worlds have more than one E-successor. This may be captured by an 
axiom like 


[1](3p (Op ^ €p)), 


or a rule similar to Gabbay’s irreflexivity rule (but this gives no new theory) [16]. 
See also [25]. The modal language of K and Kg is not expressive enough to char- 
acterize this class of frames. On the other hand, we can show using an inductive 
argument that whenever an FE -successor has more than one E-successor, then for 
any positive integer n every R,,-successor has more than n E-successors. This is 
to say, the induction step goes through, but unfortunately the base case of the in- 
duction cannot be guaranteed. Because the translation of the logic K into the logic 
Kg is sound and complete (we show this below), we know whenever a translated 
K-formula has a model then it has a model with the expected structure. 

We did not investigate the non-standard models further. It may turn out that they 
are p-morphic images of standard models, in which case they are completely irrel- 
evant because normal modal logics cannot distinguish p-morphic images. 
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3.2 From K to Kg 


Next we define a translation function mapping formulae of K into formulae of Kg. 
We show that the translation is sound and complete. 


DEFINITION 5 The translation function Il maps K-formulae into Kg-formulae 
according to the following constraints: 


Ip) = p 

I(^p) = (y) 
Ip Q y) = Ny) @N(y) 
MOn) = (n)UIl(o) 
(Ong) = [njon(y), 


where p denotes any propositional variable and @ denotes any binary logical con- 
nective ^, V, œ or €. 


THEOREM 6 (Soundness of II) The translation II from K into Kg is sound. That 
is, for any formula ọ of K 


if Fg v then Fg, II(o). 


Proof. Suppose ¢ is a theorem in K. We proceed by induction on the length of the 
proof of y and show that the proof sequence of in K determines a proof sequence 
of II(q) in Kg. We are done if we show that the II-translations of the axioms and 
the rules of K are K g-theorems. 

II leaves the propositional axioms and Modus Ponens unchanged. The transla- 
tion of the necessitation rule N is: 


Fg, 9 implies Fg. [0]Ov. 


If y holds then, by the necessitation rule for O and [0], [0]Ci» holds. Apply modus 
ponens using the contrapositive instance with n = 0 of N5 and get [0] Ov. 

The translation of A2 is a contrapositive version of N6. It remains to prove the 
translations of A3 and A4 can be derived from the axioms of Kg using the rules of 
Kz. 

For A3 we prove 


II(A3) = [0]O(e > v) > ((n)Oy > (n) Oy) 


is a theorem in Kg. Suppose [0]O(y — v) and (n) Dy hold. Suppose further that 
a(n) Oy, i.e. [n] O^v, holds. From [0]O (y — v) we infer by N4 that [n]O(y > v) 
holds. By the K-axiom for O, it follows that [n](Du — Ow). This is equivalent 


?Formally, instead of y one has to consider II(q). But for the proofs this makes no 
difference. 
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to [n|( 2D > ^O), i.e. [n] ( O^v — O-~). Using the K-axiom for [n] we infer 
[n] O ^v — [n] O ^p. From ^(n)AOv, which is equivalent to [n] OÓ—v, using Modus 
Ponens we get [n] O ^o, or equivalently ^(n)Oq. This contradicts (n)Oy. Thus 
II(A3) is derivable in Kg. 

For A4: Let 


$ = [0]O^(v ^ v) ^ (n — 1)0p ^ 2(n)Ue ^ (m — 1)Uy ^ ^(m)Ov. 
Then II(A4) is equivalent to 
o> ((n 4- m — 19U(p V y) ^ «(n * m)O(y V y)). 


We prove this in two steps. First, we prove ¢ — (n + m — 1)O(y V y). Suppose 
¢ holds. It suffices to show 


(5 (n—1)0(¢ A>). 


From (n—1)B(p^^vy), or equivalently (n—1)O((y V v) A—vy), and 
(m — 1)Oy, or equivalently (m — 1)Q(( V Y) ^ v), using axiom N8 we deduce 
(n -- m — 1)Q(e V y). 

For proving that (3.2) follows from @ we proceed by contradiction. Suppose that 
a(n — 1)O(y ^ ^y), i.e. [n — 1] o (^v V v) holds. From [0] O—( ^ Y) using N4 
we get [n — 1]0-(y ^ v) Since, in general, in any normal modal logic O (and, in 
particular, [n — 1]) distributes over conjunction, we obtain 


(60 [n —- 1](C(^e V V) AG(^e V —))). 


The K-axiom for O is equivalent to (Dy ^ Ow) O(qw ^ y). Thus (3.2) is 
equivalent to [n — 1](O((79 V v) ^ (^e V ^v))). This in turn is equivalent to 
[n — 1| O2. Thus ^(n — 1)O9 which contradicts (n — 1)Oy. 

Next, we prove $ — ^(n + m)O(y V v). Suppose ¢ holds. Then, in particular, 
^(n)Ug and 4(m)Ov hold, and ^(n + m)O( V v) is derivable by (the contra- 
position of) N7. Therefore, II(A4) is a theorem in Kg. a 


For proving completeness a semantic proof suffices. We prove (in the next the 
theorem) that for a translated formula I (Y) which is true in all K e-frames, is true 
in all K-frames. If II(») is provable in Kg then the soundness of the K semantics 
guarantees that TI(ẹ) is true in all Kg-frames, then ¢ is true in all K-frames and 
then it is provable in K (by the completeness of K). 


THEOREM 7 For any formula q of K, 
if Fx, H(p) then Fx v. 


Proof. Our strategy is the following. Suppose E-z.. II(q). (i) For an arbitrary K- 
Kg 


frame F we construct a Kg-frame F’. II(q) is valid in this particular frame F”. 
Then we show, (ii) ọ is valid in F. 
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(i): Take any K-frame F = (W, R). We construct a K g-frame F' as an exten- 
sion of the frame F as follows. For any world x € W let R(x) be the R-image of z. 
For any finite subset Y of R(x) with |Y| = n + 1 for n a non-negative integer, we 
add Y as a new world to F. We call Y a ‘set-world’. Note, every Y is non-empty. 
We define every relation Rm for m < n to contain the pair (x, Y ), and, we define 
the relation E to contain all pairs (Y, z) for z € Y . Furthermore, we assume the re- 
lations £t, and E are the smallest relations satisfying these conditions. Now, define 
F' to be the relational structure 


(W', {Rn}neno, E) 


with W’ being the set of worlds of F’ that includes the set of worlds W of F and 
all set- worlds Y . Note, the set- worlds have no Rn-successors and the worlds in W 
have no E-successors. We show that F' is a frame for Kg by showing that F' sat- 
isfies the properties N4'-N8'. 

N4’: If RA, (x, y) ^ E(y, z) holds then y must be a set-world with |y|)n and z € 
y. For u = {z} we obtain Ro(x, u) A Vv (E(u,v) — v = z). 

N5': If R(x, y) then y is a non-empty set-world, i.e. 3z E(y, z) is true. 

N6': If Rn4i (a, y) then y is a set-world with |yl)n + 1)n, i.e. Rn (x, y) holds as 
well. | 

Recall N7': 


Vey Rnim(2,y) 9 Vfg3uv(R, (x, u) > E(u, f (u)) ^ 
Ra (z,v) > E(v,g(v))) 2 
(Rn(z,u) ^ E (z, v) ^ Ely, f(u)) ^ f(u) = g(v)) 


Rn+m(2, y) means that y is a set-world with |y|)n + m. We distinguish two cases. 

Case 1: |y| = n-+m-+1. Let f and g be any functions mapping worlds to worlds. 
If there is at least one E, -accessible set-world u with |u|)n and f does not map u to 
one of its elements (~E (u, f (u))) or there is at least one Rm-accessible set-world 
v with |vl)n and g does not map v to one of its elements (~E (v, g(v))) then we can 
choose this u or v, respectively. Then the implication 


(Fi (x, u) + E(u, f (u)) ^ Ra (z, v) + E(v, g(v))) > 
(Rs (z, u) ^ Rm(x,v) A Ely, f(u)) ^ f(u) = g(v)) 


is true because the premiss is false. 

Now assume, f chooses for every set-world u with |ul)n some element f (u) € 
u and g chooses for every set-world v with |v|) m some element g(v) € v. The 
key observation for the proof is that for every set with n + m + 1 elements every 
decomposition into a set u with n + 1 elements and a set v with m + 1 elements 
overlaps in at least one element. Thus, the situation is as depicted in the following 
figure. 


(7) 
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u 21 
Pu >< 
E 
T Rn+m y Zn41 
E | 
Rm E | 
v Zn+m+1 


For finding the right u and v we follow this procedure: we start by choosing a subset 
ui C y with n + 1 elements. Suppose f (ui) = zi. If there is a subset v C y with 
lv|)m and g(v) = zı we are done. Suppose for no such subset we have g(v) = 21. 
11 is marked as ‘not an image of g'. Now we choose another n + 1-element subset 
uz of y which does not contain x1. Suppose f (u2) = x2. Again, if for some subset 
v with |v|)m we find g(v) = x2 we are done. If not, we mark x2 as ‘not an image of 
g’. We continue until we have found a suitable u and v, or until exactly n 4- 1 worlds 
remain which are not marked 'not an image of g'. In the latter case we choose this 
set for u. Suppose f(u) = x. Take v = y \uU (x). |v| = m + 1 and g(v) Z z for 
all z € y V u. Since g must select some element in v, g(v) = z is the only choice. 
Thus, suitable u and v exist that satisfy (7). 

Case 2: |y|)n +m + 1. Take any subset y' C y with |y'| = n +m + 1. By Case 
1, we can find for any f and g subsets u C y' and v C y' with the property (7). But 
these are also subsets of y and therefore the property holds as well. 

N8’: This property expresses that the union of two disjoint sets of cardinality )n 
and )m is a set with cardinality )n + m + 1, and this is true in F’. 

We have proved F' is a frame for Kz. 

(ii): Let M = (F, V) be any model based on F with V an arbitrary valuation. 
Define M’ to be the model (F', V). (Observe that V (p) does not, and need not 
contain set- worlds.) (ii) follows from 


Mz x, (Y) iff M, Ego (8) 


where x is any world in W. We prove (8) by induction on the structure of y. The 
base case in which y is any propositional variable is trivial. The inductive step for 
the propositional connectives goes through easily. We consider the case y is of the 
form O4v. (The case for y of the form D, y is dual.) The inductive hypothesis is: 


M',x x. II(y) iff ML, x Ex V. 


Suppose M',z Fx, (On), ie. (n)Oy is true at x in M'. Then, Ry(z, Y) 
in F’ for some set Y C R(x) with n + 1 elements and for all z € Y we have 
M',z Fx, V and by the inductive hypothesis M,z Fg v». There are at least n 
such z, therefore, M, x E Ov. 
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Conversely, suppose M, x E Ont). This means the world x has more than n 
successors by R in all of which «y is true. Consequently, a set Y with cardinality 
n + 1 exists that contains R-successors y of x and in all y, v is true. This implies, 
in F’, x and Y are connected by R,, and Y is connected to all its elements by E. 
Thus, (n)Ow is true in x. 

This completes the proof. a 


As consequences we get the following two theorems. 


THEOREM 8 (Completeness of II) The translation Il from K into Kg is complete. 
That is, for any formula ọ of K, 


if EZ. II(q) then Fg (p. 


Proof. Suppose II(i) is a theorem in Kg, i.e. Fg, U(y). Then, since Kz is sound 


(4), Fg, H(p). By the previous theorem Fx o. K is sound and complete (Theo- 
rem 3). Therefore, it follows that Fg ¢. E 


Now, we can show the completeness of the semantics of Kg with respect to its 
axiomatization for translated formulae. 


THEOREM 9 (Relative completeness of Kg) For any K formula p 
if Fx, H(p) then Fg, Il(o). 


Proof. If TI (p) holds in all K z-frames then ¢ holds in all K-frames (by Theorem 7), 
then q is provable in K (by the completeness of K, Theorem 3), and then Ml) i is 
provable in Kg (by the soundness of the translation, Theorem 6). a 


4 FROM MULTI-MODAL LOGIC TO PREDICATE LOGIC 


We aim at making available first-order theorem proving methods for reasoning with 
graded modal expressions. In the previous section we have embedded the logic K 
in the multi-modal logic Kr. Unfortunately, one of the axioms, namely N7, is not 
first-order definable in the standard Kripke semantics. Its relational translation N7' 
is a second-order formula. So, instead of using the standard relational translation 
we use the functional translation as proposed in Ohlbach and Schmidt [22] for non- 
first-order axioms like McKinsey's axiom. 

The functional translation method was proposed by various authors, for example 
Ohlbach [23, 24], [10], [18], [2] and [36]. It exploits the fact that every binary rela- 
tion can be decomposed into a set AF r of functions, called accessibility functions. 
Any (non-empty) relation R is defined by: 


R(z,y) & 3y € AFR y = q(x). 
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In the functional translation we quantify over the accessibility functions instead of 
worlds. For modalities determined by serial (i.e. total) accessibility relations, that 
is, for modalities satisfying the D-axiom the functional translation rules for modal 


formulae are: 

7; ([R]V, 2) — Vr:AFn n5 (9, 4(7,2)) 

7; ((Ry, x) = 3Y:AFn Tilh, 1, 2)). 
The target logic is a many-sorted predicate logic, in which AFg is the sort for acces- 
sibility functions defining R and the symbol | is a function symbol for the ‘apply’ 
function (this means, the application of a function f to x, i.e. f(x), is encoded by 
{(f, x)). For modalities determined by accessibility relations that are not serial the 
set of accessibility functions AF g contains partial functions. Accordingly, the func- 
tional translation 7 ; of modal formulae must compensate for partiality by an extra 
condition involving a predicate deg. For non-serial modalities mp is defined by: 


^den(z)— Vy:AFg n5 (v,l1(y, rz) and -—deng(z) ^SvAFm n; (V, L(y, 2)). 


The term —^deg(z) is meant to capture that x is not a dead-end in the relation R. 

We now give the formal definition of the functional translation II; for multi- 
modal logics of which K(,,,) is the weakest. IT; is a function mapping formulae of 
K(m) to formulae of a many-sorted predicate logic PLm (with predicate variables) 
with a signature specified by: 


(i) sort symbols .L (for the bottom sort), W (for the world sort) and W +. 
(ii) sort declarations W C Wt and L C W+. 
(iii) sort symbols AFg for every modality (R) 
(iv) a binary function symbol | declared by |: AFg x W+ — W+ for all AFg. 
(v) predicate symbols deg for every modality (R) (de is short for dead-end, or in 
our application, de, (x) means the set-world x does not represent more than 
n elements) and 
(vi) for each propositional variable p there is a unary predicate symbol p (we pur- 
posely use the same symbols). 


This signature defines a many-sorted predicate logic we refer to as PL. 


DEFINITION 10 (The functional translation) Let ns be a function that takes two 
arguments: a modal formula and a *world term' which are mapped to a formula in 
PLy. 7; is defined inductively by 

T(p,w) | -—p(w) forpa propositional variable 

nf([R]V,z) = —den(z) + Vy:AFn n £V, 1(7,2)) 

ng ((R)v, T) = ade p(x) ^ Ay:AF R Tf (v, AQ, z)) 
and for the propositional connectives v ; is a homomorphism. 


The functional translation for a multi-modal formula p with propositional var- 
lables p1,. .. , Pn is defined by 


Hily) = Vpi,..., paVw:W Tilp, w). 
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For a Hilbert rule of the form ‘from pı and ... and pn infer p’ 


H(i) A... Al ¢(Gn) + Hy (y) 


is the functional translation. II; maps a set 9 of Hilbert axioms and rules to the 
conjunction of the functional translation of the members: 


II;(9) = A Irl). 
pco 


IIy is called the functional translation function. 


For serial modalities the ^deg(x) ... part of the definition in 7; can be omitted. 

As with the relational translation, the functional translation of the Hilbert axioms 
yields second-order formulae with universally quantified predicate variables. This 
translation is sound, i.e. whatever can be proved from the axioms, holds in the mod- 
els of these second-order formulae. If these second-order formulae are equivalent 
to a first-order formula then the Sahlqvist theorem together with the completeness 
of the transition from the relational to the functional representation (which is easy) 
guarantees completeness. If the second-order formulae are not equivalent to first- 
order formulae then completeness is still an open problem. Unfortunately this is the 
case for Kg. 

Ohlbach and Schmidt [22] prove the following relativized soundness and com- 
pleteness result for the functional translation II. 


THEOREM 11 (Relative soundness and completeness of the functional trans- 
lation) Let 9 be additional Hilbert axioms in a propositional modal logic K(m) 
and q any modal formula. If the relational second-order translation of is com- 
plete then 


q isa 9-theorem iff I (®)—-I¢(y) isa predicate logic theorem in PLM. 


The result of the translation by II; is in general not a first-order expression. The 
translation is useful only if the axioms in II¢(®) can be described by a set ®’ of 
first-order formulae. If II (®) is equivalent to such a set ©’ the implication 


II; (9) > I f(y) 
can be proved by refuting the formula 
$ Agy). 


II¢(y) is a monadic second-order formula only with second-order universal 
quantifiers. In the negation normal form of its negation ~II p (p) only existentially 
quantified second-order predicate variables occur and these are treated as ordinary 
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first-order predicates. Therefore, ®’ — IT; (p) can be proved with the standard first- 
order procedures. 

Not every axiomatization 9 has an equivalent first-order formulation. The the- 
orem above can be strengthened for certain axioms without first-order relational 
characterizations when we use the following quantifier exchange rule. 


DEFINITION 12 (Quantifier exchange rule) Let be any modal formula in 
K(m). Define an operation Y on PLy which transforms the functional translation 
I; (yp) into its prenex normal form according to the rule 


(9  Sy:AFR V&:AFR b. ~ VAFR I:AFp vj. 


The operation Y moves existential functional quantifiers inwards thus weakening 
the original formula. Y (II ¢()) implies IT, (p), but not conversely. The quantifier 
exchange rule exploits that one relational frame in general corresponds to many 
‘functional frames’, and there is always one which is rich enough to allow for mov- 
ing existential quantifiers over universal quantifiers. This is investigated in Ohlbach 
and Schmidt [22] where a stronger theorem than Theorem 11 is proved, namely: 


THEOREM 13 (Relative soundness and completeness of the functional trans- 
lation with the quantifier exchange rule) Let © be additional Hilbert axioms in a 
propositional modal logic K(,,) and p any modal formula. If the relational second- 
order translation of 9 is complete then 


v isa 9-theorem iff Y(II;j(*)) > Y(II;(q)) is a theorem in PLy, 


provided in 'Y (I1, (q»)) all existential functional quantifiers are moved inward as 
far as possible. 


This theorem says that the original formula y can be proved to be a theorem in the 
system ® by proving its weakened translation Y (II ¢(y)) using the weakened forms 
Y (II; (9)) ofthe translations of the axioms in ®. In T (IIy(q)) the existential func- 
tional quantifiers are pushed inward as far as possible. Negating Y (II; (p)) we si- 
multaneously replace universal quantifiers by existential quantifiers and existential 
quantifiers by universal quantifiers. The quantifier prefix of the prenex normal form 
of ^Y (II ¢(y)) consists of a sequence of existentially quantified predicate variables 
pi (ended with an existentially quantified world variable) followed by a sequence of 
existentially quantified functional variables, followed by a sequence of universally 
quantified functional variables. In the Skolemized clause form of ^Y (IIr(q)) no 
Skolem functions occur, only Skolem constants. This simplifies the translation con- 
siderably. More importantly, Y allows us to move existential quantifiers inward as 
far as we like. This weakens an axiom like the McKinsey axiom just enough so that 
we get a first-order translation for the axiom. This operation works for axiom N7 
as well, as we will see in the next section. 

The functional translation generates nested |-terms as arguments to predicates. 
We can avoid these by using the world path notation of Ohlbach [23]. To this end we 
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add a new sort symbol AF* to PLy and we let o be a new binary function symbol. 
Furthermore, we include the following axioms and sort declarations: 


AFg CAF 
o: AF* x AF' ^ AF" 
Vz:W Vy, 0:AF* (yoô, x) = 1(6, L(7, x)) 


o 1S associative. 


o denotes composition operation of accessibility functions and AF* denotes the set 
of all possible compositions of all accessibility functions in the union of AFg. In- 
stead of nested |-terms, like |(6, |(7, z)), we use a more economic notation and 
write [((*y o ô), x) or [([yó], x) (omitting o), instead. The latter uses the world path 
syntax which we prefer from here on. 

Notice that the conditions in the two theorems requiring that the relational trans- 
lation of the axioms into second-order logic is complete means that all formulae 
which are valid in the frames characterized by these second-order formulae are 
provable from the axioms. For Kg we showed this for the translated K formulae 
only. Since this is sufficient for our purpose, we can assume completeness of these 
transformations. 


5 FROM Kz TO PREDICATE LOGIC 


In this section we apply the functional translation method explained in the previous 
section to the modal logic Kg which we introduced in Section 3. 

Recall, Kg is a multi-modal logic with infinitely but countably many numer- 
ical modal operators ((n) and [n]) and two special membership operators © and 
Cl. In the relational semantics the numerical operators are interpreted by the set 
Ut.) ew of binary relations and the membership operators by the special relation 
E. The functional translation for serial modalities is considerably simpler than for 
non-serial modalities. The accessibility relations R,, (n € INo) and E are not se- 
rial. Axiom N5, (n)Oy > (n)y, specifies a weak form of seriality for E. Every 
world accessible by some £, (i.e. every set-world) has a successor by E. 

We do not need the full expressiveness of the language of Kg. A subset of for- 
mulae with characteristic patterns of modal operators (n), [n], © and O will do. 
For example, in the axiomatization defining Kg the operators © and O do not oc- 
cur in the scope of © and O operators, they always occur in the scope of (n) and 
[n] operators. This is intentional. Only these patterns make sense in our applica- 
tion of Kg. We think of the numeric modalities picking only sets and the © and O 
operators picking only elements of these sets. For this we need a special class of 
formulae in which the E-successors of worlds accessible by E are irrelevant, only 
E-successors of worlds accessible by the relations R„ count. We are therefore per- 
mitted to assume E is serial (which we prove in Theorem 14). 
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The language of Kg that we will use is restricted to the set of admissible formu- 
lae. We say a formula y of Kg is admissible iff all © and O operators appear in the 
scope of a (n) or [n] operator (for some n). Examples of admissible formulae are: 


(n)Op, [n]Op, (n)(pA%q) and [n]([üup— 94). 


The formulae 
Op, and (n)O0q 


on the other hand, are not admissible. We note that the translations of K (presented 
in Section 3.2) are admissible formulae, since the corresponding modalities for 
modal operators of K are (n)O and [n]©. The axioms N4-N8 are also admissible, 
whenever o, y and $; are admissible. 

For admissible formulae we may assume the relation E is serial. 


THEOREM 14 Let q.be an admissible formula of Kg. If « is valid in a model then 
€ is valid in a model in which the relation E (associated with the modalities © and 
O) is serial. 


Proof. Let y be valid in a model M = (F, V) based on the frame 
F= (W, Us] aen; E). 


Define F* to be the structure (W, {Rn ) nemo , E*) obtained from F by replacing E 
with E*. E* includes E and all pairs (x, x) of x € W for which no y € W exists 
such that E(x, y). Then, E* is serial. 

We show that y is valid in M* = (F*, V). The only critical case is where yy has 
a formula equivalent to y = O¢ ^ C6 as subformula. For v is true at any world 
x in M iff x has no E-successor in M. In M*, however, w is false at x whenever 
p is true at x in M(otherwise there is an inconsistency). 7 occurs in the scope of 
either (n) or [n] for some n. First, we consider the case that y occurs in the scope 
of (n). Let ' be the (n) subformula of y with v» in its scope. We may assume yy’ 
is of the form (n)((V V a) ^ B). Now, suppose y’ is true in a world z of M. Then 
there is ay € W such that R,(z, y). Since E is weakly serial there is az c W 
such that E(y, z), which implies v is false in y of M. Hence, y’ is true in x of M 
iff it is also true in x of M*. Next, we consider the case that ~ occurs in the scope 
of [n]. Assume y’ is of the form [n]((» V œ) ^ 8) and suppose yy’ is true in z € W 
of M. Then either there are or there are no y's in W such that R,(z, y). If there 
are y's then we argue as above. If there are no y’s then y’ is trivially true in x of 
M*. We conclude, x is indeed valid in M*. i 


This theorem licenses the translation of the © and O operators without the dead- 
end predicate deg. For admissible formulae in Kg the translation function 7; is 
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modified as follows: 


7 ¢([n]q, x) = nde, (x) > Vy:AF, 7 £ (9, L(y, 2)) 
m ((n)v, x) = rden(x) ^ 3r'AFs v5 (5, My, )) 
T, (Dv, x) = Vy:AFE TF (V, L(y, 2)) 
m(Op, x) = J3yvAFg 13 (¥,1(7,2)). 


We are now set to compute the functional translation of the K-axioms N4-N8. 
This is a mechanical and tedious task, which we left to an implementation of the 
general translation procedure and our tool for eliminating second-order quantifiers. 

In our listing of the result we use the following notation. y:n is the abbreviation 
for y:AF,, and y:E for y:AF g£. Variables without sort declarations are assumed to 
be of sort W. In the respective clause forms variables and Skolem functions are 
indexed by their sort. The value in the subscript of a Skolem function, say f”, 
associates the function with the m-th modality (m). AF, is the sort of the terms 
formed with f7 . The superscript is part of the name of the Skolem function. It is 
only used to distinguish the different Skolem functions for the different instances 
of the clauses. (In an actual implementation, these numbers are the objects of sym- 
bolic arithemtical manipulations.) yn indicates "y is a variable of sort AF,,. 

In the following we list for each axiom N4—NS, (i) the functional translation, (ii) 
the first-order equivalent formulation and (iii) its clause form. 


N4: Fe, [0]o9 2 [n]O¢. 
The functional translation II; (N4) is 
VoVz [(^deo(x) > Vy':0 36 :E p(L([7'6'], z))) > 
(^des (x) > Vy:n Vó:E e(L([y6], x)))]. 
This has a first-order equivalent formulation, namely 
Vr deg(z) — de,,(x) ^ 
(10) Vz [^de,(x) — (Vy:n Vó:E 35:0 WOE 
Hird], z) = Liy], z))]. 
The clause form is: 
-^deo (x) V den(x). 
des (x) V Aag], v) = MFO (2,5, 97)9], 2). 
N5: This axiom becomes a tautology because we assume E is a serial relation. 
N6: Fe, [ny — [n + 1]o. 
II (N6) is given by 
VoVz [(-den (x) 4 Vó:n p(L(6, x))) > 
(“den+1ı (x) 2 Vy: p(L(y, x)))]; 
the first-order equivalent by: | 
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(11) Va den (x) — dény1(z) ^ 
Vz [den (x) + (Vy:n+1 3ó:n L(y, x) = 1(6, x))], 


and the clause form by: 


~den (x) V des 43 (2). 
des41 (z) V L(Yn41, £) = Lon(z, Yn+1), 2). 


N7: Fg, (n+ m)O(p V y) > ((n) Oy V (m) Op) 
is translated to II ¢(N7): 


Vow x [(-dentm(x) ^ 3y:n&m V9:E (p(L([76], z)) V YL 74], z)))) 
— [(-de,(z) ^ 3y:n Va:E e(1(Hya], z))) V 


(^des (z) ^ 3y:m VB:E v(V(v8], z)))]]. 
This is equivalent to the formula 


Va [dent (x) V [^des (x) A ^des (x) ^ 
Vy:ntm Va, B:E 30:E ay:n 3y:m 
(L([Yn4+md], £) = Hiinal), £) ^ 
L([¥n+-m4], 2) = M[580)]; 2)))]l. 


(Note, here we have used the sorts as indices to distinguish three different 
variables: Yn, Ym and Yyn4m.) This formula is still second-order (note the 
a(n) and B (4,4) terms). We get a first-order equivalent formula if we apply 
the quantifier exchange rule Y to II; (N7). T (II;(N7)) is 


VeyVz [(^den.m(z) ^ 
3y:nm V6:E (e ([6], z)) V v (.(19], z)))) 2 
[(^des(z) ^ 3y:n Va:E e(A([yo], z))) V 
(nde (x) ^ VB:E 3y:m v(([8], z)))]] 


which is equivalent to the first-order formula 


Va [des (x) V [^des (x) ^ ^dem (xz) ^ 
Vy:nm No: E 3ó:E 3y:m VB:E 3»y:n 
QI m6]; x)= VW], T) V M29], T) = Liynp], z))]]. 


The clause form is: 


des. (x) V ^de, (x). 
des. (x) V ^dejm (x). 
den 4m (x) V Ass hz" (x, Yn+m, a g)], T) 
= L([h27" (£, Yn+m, OE, BE) ag], x). 
déntm(Z)V LU [Yntmh1E"(Z, ym; o E)]; x) 
= 4h37 (z, Yn+m) ar) Bel, z). 
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N8: Fg, ((n)B(v ^ y) ^ (mOl A 2v)) 2 (n +m + 1)0¢ 
is translated to IT; (N8): 


Vu Vz [[(^des (x) ^ 3y:n Vo:E (e((yo], x)) ^ v(4([yo], z)))) ^ 
(^dem (x) ^ 3y:m V8:E (e(1(8], z)) ^ ^vQG(GIY8], 2))))] 2 
(^den-cma1(z) ^ Jy:n+m+1 V6: E p(L([79], x)))] 


This is equivalent to 


Vz [de (x) V des (x) V 
Vy:n V*:m 3y:n* m1 Vó:E | 
(den m41(x) V da, B:E Hina], T) = Aw. 8], z)) ^ 
[3o, BE L([yna], x) = Hmp], x) V 
Ja:E L([yna], x) = Ms m16], 2) V 
38:E Uimh], x) = L([¥n+m414], 2)]]. 


. The clause form is 


de„ (x) V des (x) V ^de, 4 m41(x) V 


Linkt E” (T, Yn, m) 2) = Mv kK28" (£, Yn, Ym)], 2). 
de„(x) V des (x) V 

VIS k3 E” (T, Yn, m) 2) = MY kAE" (2, Yn, Ym)], £) V 

As K5z" (z, y, óg)],z) = AER, La (T, ns ^Im )Ó E], z)V 

VIys K6777 (T, Ym) óg)], T) = Ulker Emt (x, Yn, Ym)ôE], x). 


The K g-axioms and their translations are schemas. They represent the conjunc- 
tion of all instances with the n and m taking on concrete non-negative integer val- 
ues. This can be exploited in certain generalizations. For example, the subformula 
Vz den (x) — den+1(x) of (5) can be generalized to 


Va den (x) — dem(x) for all m)n. 


This formula subsumes the subformula Vz deg (x) — de, (x) of the translation (5) 
of N4. The remaining part of (5) can also be generalized to: 


Va [rdem(x) + (Wy:m 3ó:n L(y, x) = L(6, x))] for all m)n. 
The clause form is 
(12) dem(z) V Lm, x) = L(gn™ (£, ym), x) for all m)n. 
Recall the relation translation of N6. We noted N6' generalizes to 
Rm C Rn for all m >n. 


This ordering on the accessibility relations {Rn }neng induces a linear ordering on 
the set (AF, }neny of sets of accessibility functions. We capture this ordering by 
the subsort declaration 
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(13) AFm C AF, for all m 2 n. 


In a resolution calculus this declaration has the same effect as clause (5). We there- 
fore replace (5) in PLm by the subsort declaration (5). 

The final translation for Kg in PLy is still to come. We add yet more syntactic 
sugar and hope this makes the translations more easily readable. Every translated 
Kz-formula ¢ in negated form, which we aim to refute, contains only terms which 
have the form |([s1 . . . Sn], xo) (in world-path notation). zo is the Skolem constant 
originating from the Vx quantifier in II (7). Such terms can be replaced by just 
[s1 ..- $n] or just the empty list [] for formulae not containing modal operators. 

In the clause form of the translations of the K g-axioms the equations contain 
terms of the form | ([t; . . . tm], 2) with x a universally quantified variable. We may 
instantiate 2x with, say ([s1...5n],20), and obtain |([t,...tn], 
{([s1 - - - Sn], zo)) which is the same as |((s1 ... Snti . . . tm], £o)). We get the same 
result if we introduce a new variable w, of sort AF" and replace |([t; ... 
tm], £) with [w,t, ...tm]. We further require that w, can be unified with arbitrary 
strings [s ... Sn]. 

In this notation the axiomatization of Kg reduces to the following set of PLM 
formulae which defines our predicate logic theory for K g. 


Pl = den (wa), [Wenz] = [wx fg (Wa, Fn, z)y] 
P2 AF, CAF, for all m)n 
P3  -de4,(w,), des(w.) for all m)n 


P4 des m (w«), 
[w. Enymhl”™ (wy » Zn4m y)] = [ws h2»" (ws Tn+m, Y, z)y] 


P5 de, m (w«), [wezs m h1" ^ (We, Tntm, y)] = [w, h35" (Ws, In+m, y)2] 


P6 demax(n,m) (wx), ^deés mi (Ws), 
[wx kK1"" (w, , En, Ym)| = [WeYmk2"™ (We, Ln, Ym)] 


P7 demax(n,m) (ws ) , 
[w En k3" ^ (Wx, En, Ym)] = [w.ym kA" (We, Ln, ym.)]; 
[w, x4 k5"" (Wx, £n, z)] = [ws ko s (s Tn, Um)2]; 
[Ws ym k6" "^ (We, ym, z)] = [ws kr m41 (We, Tn, Um)2]. 


(The variables y and z and the functions h1”™ and k1”™-—k6”™ without index are 
variables and functions of sort AF £.) 


THEOREM 15 For any K-formula y, 
Q isa K-theorem iff  (PI-P7) —^ Y(IIg(q)) is a theorem in PLy, 
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By way of examples we illustrate how P1—P7 can be used during inference with 
a resolution-based theorem prover. Bernhard Nebel provided the following exam- 
ples. 


EXAMPLE 16 The set A = (O9O3T, OgUs.L, O11} is an inconsistent set of 
K-formulae. With theory resolution we can show the inconsistency in a single step. 
The Kg and the PLy formulations of O9O3T, O9U;.L and O; L are respectively 


(0)0(3)0T and -deo([]) ^ ^des([aox]), 
(0)Q[3]O.L | and — ^deo([]) ^ des([boy]), 
[1]O.L and  de([]). 


The set À is represented by the following set of clauses: 


Ci -dee([] C3 des([boy]) 
C5  -des([aox]) C4 de([l 


where x and y are variables and ag and bg are Skolem constants. Letting n = 0 
and m = 0 in P6 and using the substitution 


{Zo > Ao, Yo ^? bo, £ > k199 ([], ao, bo), y r1 k299 ([], ao, bo)) 


P6 simultaneously resolves with C1—C^ yielding the empty clause. 
The next example is more complicated. 


EXAMPLE 17 The set B = {O9O0O3T, 9999031, OoOiL, Git} of K- 
formulae is inconsistent. B contains the formulae of A (from Example 16) prefixed 
with a Oo and the formula O, L. The set of expressions in B is represented by the 
following clauses (derived as in the previous example via Kg and II; translations, 
which we omit): 


C, -dee([] Cs des([coz'doy']) 
Co -deo([aox]) Ce de, ([eoz"]) 
C3 -des([aoxboy]) C7 deı([]) 


C4 deg ([cox']) 


For the refutation we use P1 with n = 0 and P6 with n = 0 and m = 0. P1 can 
immediately be simplified with clause C4. The instances are: 


Pl' [fo((], 20, z)y] = [x02] 
P6' deo (w.), nde, (w.), [w.aok1™ (w., To, yo)] = [w..yo 290 (ws, To, yo)]. 
The result of simultaneously resolving P6', C4, and C7 with unifier {w, + {|} is 


Cs [ro k199 ([], zo, yo)] = [yo 299 ([], zo, yo)]. 
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Paramodulating with Cg and with unifier {xo > ao, œ> k1°°([], ao, yo)}, C3 
becomes (this means we do equality replacement with unification in C3 using the 
equation Cg) 


Cy  —des([yok29? ([], ao, yo) boy). 
This becomes 
Cio  ^des([rozboy]) 


when paramodulating with P1' using the unifier 


(yo MA fa ([]; zo, 2), v e k2% (D. ao, y0)}. 
We resolve P6' and C4 using unifier {w, +> [cox], z' =œ x} to get 


Cıı -de,((coz)), 
[coa k1^" ([cox], zo, yo)] = [coxyok2"" ([coz], £o, vo)]. 


Now, use the unifier 


[xo > co, z' ++ z — 2,25 + do, 
Yo m bo, y' r7 k199 ([cox], do, bo), y e k2% ([cox], do, bo)} 


and apply E-resolution to C's, C19 and C3; and get 
C12 ~de: ([coz]). 


(This means we resolve between Cs and Cio using an equation in C1.) Resolving 
this with Cg using E-resolution with Cg yields the empty clause. The unifier is 


[zo 9 eo, 2" + k1% ([], eoco), yo + co, 2” + k2% ((], egco)}. 
EXAMPLE 18 In this example we show 


(14) (OnpA Os ^ Donly ^ )) 9 Ont mtil¥ V V) 


is a theorem in K by showing that the following set of clauses is refutable. The set 
represents the negation of the theorem. 


Ci de([[ Cs deo([), ~p(lyoc)), ^v([voc]) 
Ca w([a«z]) Ce des mail) vn maid] 
C3 -des(l[l) C; dens, maa(l[l). Y ([rn-cm41 d]) 


C4 wV([bz']) 
Cs can be resolved with P3, letting n = 0 and m = n, and Ci yielding 


C; -—we([vyoc]), ^v([voc]). 
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This can be paramodulated using the equation in P1 and using the unifier 


{we + [yo fo (24.2). y > c}. 

The result is: 

C5 e(raz. —w([razp). 
Resolve this with C5 and get 

Cs | —wV([as.z]). 
Now, take Cs and paramodulate with P7 using the second equation and the unifier 

{ws > [zsm4i H En mall Tn, Ym), z > d} 

and obtain 

Co dentm+1(l), (rs KS" (i, £n, d)]), 


€max(n,m) , 


[r4 k3"""^ (T, En, ¥m)] = [ym k4" ^ ([], En, Ym)), 
[ymk6"™ ([], Ym, d)] = [E mai (ll, In, Ym)d]. 


The demax(n,m)([]) literal can be eliminated from Cs with either C1 or C3 in one 
resolution step. The clause 


Cg denim (ll) vs k5"" ([], Pn, d)]), 
[r4 k3"""^ ([], £n, Ym)] = [yum kA" (|, £n, y)]; 
[ys k6"7" ([], Ym, d)] — [Ema Tn, Ym)d] 


remains. Take C; and paramodulate with Cg and unifier 
{ln4+m+1 7 LOMAPIPRRA LE Tn; Ym) }- 
We obtain 


Cio des m4i((]), 79([2nk5"™((], £n, 4)]), =Y ([ymk6"™ ([], ym, d)]) 
[r4 k3"" (T, En, 9m)] = [ym k4""" ([], En, Ym)). 


Use Cz and C, to get rid of the ^q and the ~y literals. The unifier is 
{En > an, Um P bm, x e k5"" ([, an, d), x' + k6”™([], bm, d)]. 
Cio becomes 


Cu des 4m4a (Il) [ank3"™ ([], an, 55.)] = [bmk4”™ (|), an, 55, )]. 


This we can use to paramodulate with Cg. The unifier is 
{z +> k3"" ([. an, bm)} 


and the result is: 
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Cia. dent m4i([]), ^v [bo kA" ([], an, bm)))- 
Resolve this with C4 which yields 


Cis de, m 41 ([]). 


Now we use P6 and get 


C14 démax(n,m) ll); [ro k1" (|, En, 9m)] = [ym k27" (|, Fn, ym)]. 


Get rid of the demax(n,m)([]) literal by resolving with either C1 or C3. The equa- 
tion 


Ci, [2nk1°™([], xa, ym)] = [ymk2"™ (|, £n, Ym). 


remains. We use Cs again and paramodulate with Ci, substituting with 
{£n +> an, z+ k1"™((],2n, ym)} 


which leaves 


Cis 7 ([yYmk2”™ (D, an, ym)])- 


In the last step we resolve C15 and C4 with unifier 


{Ym HO br» k2"™ (|), tn, Ym)} 
to get the empty clause. 


In the functional translation we can prove instances of formulae with concrete 
values assigned to the n and m in the modal operators. There are examples of for- 
mulae for which the proofs with symbolic arithmetic terms instead of concrete val- 
ues work as well. However, this approach may not always work. The formula (5) 
below provides an example of a theorem which is true for all n and m (that satisfy 
the required restriction), but which can be proved in our system only for concrete 
instances of n and m. The situation may be worse. It may be the case that the proof 
of a formula for a particular concrete instance n depends on the instance of the for- 
mula for n — 1, and the proof of this instance depends on the formula for n — 2, 
and so on, to the formula for 0. Call this ‘induction on foot’. We now demonstrate 
a process of how a schema like (5) can be proved in our system by (ordinary) in- 
duction for all values followed by a translation step which yields a lemma we add 
to our theory. 

Suppose there are at least twenty objects in p and at least twenty objects in q and 
in all thirty objects exist. Then we expect the intersection of p and q to contain at 
least 10 objects. Our intuition is captured by the following formula 


(15) O4p ^ Og ^ D;a(o ^ v) m4 On4m41-j(9 V v) 
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withn +m + 1 — j > 0, if we letp=y,q=y,n = m = 19and j = 9. 
In Example 18 we showed (5) for the case that 7 = 0. Unfortunately there is no 
resolution-based proof for the general case. In the next theorem we use induction 
to prove (5). 


THEOREM 19 (5) is a theorem in K. 


Proof. The proof is by induction on 7. We proved the base case in Example 18. Let 
j)0. As induction hypothesis assume 


One ^ OqU ^U; 12(^e ^ V) 9 Ont m4i1—(j-1)(¥ V V) 


holds. Assume further Ony, Omy, and O;7(y A vy) hold. That Q;-(q ^ Y) holds 
implies (>0;-17(y ^ v) A Ojal ^ v)) V B-1^( ^ v) holds. 

Suppose 0,;_1-(y ^ v) holds. Then we apply the induction hypothesis. We get 
Onm41-(j-1)(9 V V) which implies, by A2, O44.,41—;(u V v) holds. 

For the second case assume 2D; 1 ( ^ Y) & O5-1(Y ^ v) holds. Let k = 
n +m, p =p Ay andy = y ^v in AIO. Then 


Ory ^ Only Av) > Oxk-a(q A ^w). 


From Ony, respectively O,,:, and 0;7(y~ ^ Y) we infer that O4. .;(w ^ ^y), re- 
spectively O4, .;(^« ^ v), holds. Hence, by A12, 


Om-n41—j-1((9 ^ ^V) V (^w ^ V)) 


holds. Using A12 again, this time applied to the formulae 


Om+nti-j-1((P ^ ^V) Vp ^v)) and O;i(p^w), 
we conclude Om4n+1—j(¢ V Y) holds. This proves the theorem. a 


__ The next result shows we can replace the axiom N8 in Kz by the corresponding 
K -formulation of the formula (5). 


THEOREM 20 Axiom N8 of Kg can be replaced by 
(16) (n)\Op ^ (m)Oy ^ [j] Only ^ v) 2 (n m +1- j)G(e v v) 
forn+m+1—j2>0. 


Proof. In Theorem 19 we proved (5), its K-formulation, is a theorem in K. Thus, by 
Theorem 6, (20) holds in Kg. It remains to show (20) implies N8. This is immediate 
if we let 7 = 0 and substitute y A w for y and y A ^v for exploiting []|O T + T 
(N4). a 
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Although replacing N8 with (20) does not increase the number of provable formu- 
lae, we avoid the induction argument necessary for proving (20) which we would 
have to provide by hand as we don’t have an induction theorem prover at our dis- 
posal. Also, we avoid proving instances of (20). 

The functional translation of (20) into predicate logic is somewhat more com- 
plicated than that of N8. It is given by 


Vu Vz [[(^des(z) ^ dy:n Vó:E p(1([75],2))) ^ 
(^de (x) ^ 3y:m Vó6:E v(V([yó], x))) ^ 
(^de;(z) + Vy:j 36:E ^(ve(1(6], )) ^ v(1(9], 2))))] > 
[(^den4-m41—;(z) ^ 
Fy:ntm+1-j Vó:E (o(4([74], z)) v Y (Hivd), 2))))]. 


Like II; (N7) this formula cannot be reduced to a first-order formula. We swap the 
quantifiers 3»y:n--m41-7 and Vó: E. The quantification elimination algorithm SCAN 
produces then for this input the following clauses: 


P8 demax(n,m) (ws), ^den 4 m41- j(w.), l 
[wa f 77^ (Wa, En, £ )u] = [Wr En f5°™ (We, 4, u)] 


P9 demax(n, m) (wx), ^7deénm41-j (w.), 
[wa f 77^? (Wa, En, 1$ )u] = [Wim fO"™ (Ws, Lm, u)] 
P10 demax(n,m)(W«), ^de;(w.), 
[w. Aus (ws, U, z4)v] = [win f 2^? (Ws, v ; £n], 
[ws f35 i-i (ws ;U, Lm)v] = [wm f 4^7? (w., v, Im)| 
P11 demax(n,m) (wx), 
[w fij m4i-j (wx, V, £4)v] = [ws tn f 2^7? (ws, U, Tn)], 
[w. f3ntm+1-j (ws, v ; $£q«)v] — [Wa 2m f"? (ws, U, Tm), 
[wa f7; (Wa, En, Lm)U] = [w £n f5 (We, £n, u)] 
P12 de max(n,m) (ws), 
[ws f 152 1-5 (WV, £4)v] = [ws tn f 2^? (w, , v, £n)], 
[w. f3ntm+l-j (Wa, v, zm)v] = [Wim f4 (We, v, £m), 
[w, f 7572 (Ws, En, zm)u]- [Wm f 6^7 (wr, x4, ,u)] 
together with the clause 
(17) de,(w), dem(w), ^de;j(w), ^den4m41-;(w), 


which is implicit in P1—P7. We can show that, for any positive integers n, m and 7, 


Jk l (k,l) € {n,m} x {j,n+m+1-j} suchthat k> l. 
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For, suppose not. Suppose n, m and j exist such that for any k and l with (k,l) € 
{n,m} x {7,n+m+1-j} we have k(l. Then, n(j, m(j and n(n 4- m - 1 — j. Hence, 
j(m + 1, and thus, m(j (m + 1, which cannot be for 7 a positive integer. 

If the values n, m and 7 are such that we can choose k and l with k strictly larger 
than / then (5) is subsumed by P3. Otherwise, if the values are such that we can 
choose identical k and /, then (5) is a tautology. In either case (5) is redundant. 

We conclude this section with an example (supplied to us by Werner Nutt) in 
which we exhibit the computational effect of using the clauses P8—P12. 


EXAMPLE 21 Suppose the universe consists of at most thirty objects. If there are 
at least twenty objects in p and there are at least twenty objects in q, then there are 
at least ten objects in p ^ q. A standard tableaux system for the number operators 
would generate twenty witnesses for p, twenty witnesses for q and then it would 
need to identify ten of them in order not to exceed the limit of thirty. But there are 
combinatorically many ways for identifying ten of them. 

In our system we prove the conjecture by showing the following set of K-formulae 
is inconsistent: 


{Oi9p, C194, Uao.L, Dg (p^ gq) }. 


We can choose any other suitable combination of numbers. This will not change the 
structure of the proof at all. The translation into PLy is: 


{-de19([]) ^ p([ai9z]), ^deio(f]) ^ a([b192]), deao([]); 
deg([]) V ^»([yoc]) V ^a([yoc])]. 


The corresponding set of clauses consists of: 


Ci -deio(ll) C4 deso([]) 
C2 p([aoz]) Cs deg([), ^P([usc]), ^a(lvoc]) 
Cs  q([b1oy]) 


Resolve Cs with P3 and C, and eliminate the deg((]) literal from Cs leaving: 


Ci -—P([voc]), ^a(lyoc]) 


We resolve the instance of P9 with n = m = 19, 7 = 9, namely 


P9' deis ([]), -deso(T]), 
[f79°*° (J, T19, T19)U] = [r1 f 61? 19 *([] 119, u)], 


with C and C4 and obtain 
Cs [S73 19 *([], 119, t1g)u] = [zio f 61? 19 (i, Tig» u)]. 


Applying the unifier (yg +> f719!9 9([], x19, x19), u 9 c}, we can use this ina 
paramodulation step with C resulting in 
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Cr -p([rigf61919 ([, x15, c)]), ^a([£ 739 19 ?([], 219, x19)c]) 


Unify in Cz and Cz with {x19 +> aig, z +> f619!? *([], x15, c)}. Resolving C» and 
Cy yields 


Cg q([f 7$? 1? ?([], £19, a19)c]). 


Now we use the following instance of P8: 


P8' deig(w,), —deso(w.), 
[w f 7$? 19 °(ws, T19, Lig) tl = [wsz f 5!? 19 (ws, T19, u)] 


This can be reduced with C1 and C4 to the equation 
Co [79°19 9([], x19, z1g)u] = [119/51? 1? *([], x19, u)], 


which we can now use in a paramodulation step with Cg. We get 


Cio ^q([z1o f 51? 1? 9([], 219, c)]). 


The empty clause is obtained if we resolve Cio with C3 using the appropriate uni- 


fier. 


6 FROM CONCEPT DESCRIPTION LANGUAGES TO GRADED 
MODALITIES 


A knowledge representation system in the KL-ONE-style [4] usually consists of a so 
called T-Box and an A-Box [5]. The T-Box axiomatizes the part of the world that is 
to be modelled in the system whereas the A-Box is more or less a classical database 
containing information, in general ground facts, about the actual situation. 

Most T-Box (or terminological) languages have as syntactic primitives concept 
names and rule names. Concept names denote sets of objects and role names denote 
binary relations between these objects. Using concept forming connectives, like ^, 

N, Li, some and all, compound concept terms can be built which also denote sets 
of objects. 

A prototypical concept description language is the ACC language (short for ‘at- 
tributive concept description language with complement’). It has a well- 
defined model-theoretic semantics and its computational behaviour is completely 
understood. The terminological language of ALC uses only the concept-forming 
operators ^, M, U with the usual meaning (complement, union, intersection) as 
well as role quantifications (all R C) and (some R C). (all R C) denotes the set of 
all objects whose R-successors (R-fillers in the KL-ONE terminology) are all in C. 
(some R C) denotes the set of all objects with some R-successor in C. Typical ex- 
amples for concept definitions in ALC are: 
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man = person M (some sex male) 
parent = person M (some child T) 
father = parent M man 
grandfather = father M (some child parent) 
woman = person f1 man 


T denotes the set of all objects. 

Given a set T' of concept equations, a concept C is coherent if there is a model 
for T' in which C denotes a nonempty set. Furthermore, a concept description C 
subsumes a concept description D in T, if C denotes in every model of T a super- 
set of D. Deciding coherence and subsumption is the basic reasoning service of the 
knowledge representation systems based on ALC. According to the above defini- 
tions, for example, it is possible to infer that grandfathers are fathers and persons 
and men as well, i.e. man subsumes father and grandfather. 

In [28], Schmidt-Schau8 and Smolka show that deciding coherence and sub- 
sumption of concept descriptions is P-SPACE-complete and can be decided with 
linear space. Many variants and extensions of ALC have now been investigated 
[19, 20, 21, 27, 9, 8, 35] and are used in implementations of knowledge 
representation languages [3]. We focus on a language very much like ACCN which 
includes numerical quantification operators atleast and atmost. For example, the 
concept term (atleast 3 has-child blond) represents the set of individuals who have 
at least three children who are blond. The term (atmost 2 )has-parent T represents 
the set of individuals who have at most two parents. The language we consider is 
slightly more expressive than ACCN. Our version, referred to as ACC * , allows 
for arbitrary concepts to be included in other concepts, whereas in ACCN only 
atomic concepts can be included in other concepts. 

Now, we define the syntax of ACCN +. The signature of the terminological lan- 
guage of ACCN* consists of a set Uz of role names and a disjoint set Sc of con- 
cept names. From role names Q € Lp and concept names A € Xc compound 
concept terms C are formed according to the following rules: 


CD — A|-5C|Cr1iD|CuU D |(some R C)| (all R C)| 
(atleast n R C) | (atmost n R OJCCD|IC- D. 


n is a non-negative integer. Most authors define the symbols E and = to be sen- 
tential symbols. We define them to be connectives just as M and LI! are. Note, we 
consider terminological sentences of the form C C D and C = D to be concept 
terms. In ACCN terminological sentences are constrained to be of the form A C C 
and A = C, where A are concept names. A T-Box is defined as a set of concept 
terms. 

The semantics of ACC * is specified by an interpretation Z = (U,V) with U a 
non-empty set U (the domain of interpretation) and a signature assignment V. The 
signature assignment maps role names to binary relations on U and it maps concept 
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names to subsets of U. The interpretation of concept terms C and D specified by: 


C! = V(C) if C is a concept name 
(nc)? =U\C? 
(CND)? =c* nD? 
(Cu D)? = C7 u D 
(CED)? = (U \ C?) u D? 
(C =D)? = (U \(C? UD*)) u (C7 n D?)) 
(some R O* = (ze U | 3y € U R? (z,y) Ay € C7} 
(all R OF = {x € U|VyeUR'(z,y) Bye C7} 
(atleast n R C) = (z € U | {y € C7 | R7 (x,y)}| 2 n) 
(atmost n R C) = {z € U | |{y € C7 |R7(z,y)}| € n) 


Atomic concept names in a T-Box T are interpreted as the entire domain and are 
all equivalent to the top concept T. T is the largest element in the subsumption 
ordering. The complement of T is L and represents the empty set. 

An interpretation Z = (U, V) with C? = U for all concept terms C in the T-Box 
T is a model of T. A concept term C is universal iff C? = U for all interpretations 
T. Cis empty iff C! = @ for all interpretations T. 

The entailment relation }= between concept terms is defined by: 


CED iff D? =U for every interpretation T of C. 


Then C F- D iff C C D is universal iff C N 2D is empty. 

We treat sets {C;,..., Cn } of concept terms in the same way as the conjunction 
C, T1... N Cn. Thus, a given T-Box T will be treated as the conjunction of its 
elements. 

In contrast to other terminological languages the language ACC * includes no 
role-forming operators. Roles that occur are all atomic. To simplify our presenta- 
tion, without loss of generality we assume there is one atomic role R. 

Now we show that we can embed ALCN in K. Define a mapping II from the 
language of ACCN to the language of K by: 


IC) = C  ifCisaconcept name or T or L 
(ac) = -—II(C) 

I(CnD) = MT(C)AT(D) 
II(CUD) = II(C) VII(D) 
II(CCD) = II(C)—TI(D) 

II(some RC)) = OgII(C) 
II(all RC) =  GygII(C) 

II(atleastn RC) = 9, 4II(C) 


II(atmost n R C) O,-11(C) 
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It is easy to verify that IT is well-defined. The following is the main statement of 
this section. 


THEOREM 22 (Soundness and Completeness of IT) 


A concept term C is universal iff II(C) is a tautology. 


Proof. Let Z = (U,V) be any interpretation of a T-Box of ACCN*. Let M be 
the modal model (U, R7, V). By induction on the structure of C prove, for every 
rcU: 


zec? iff M,z E II(C). 


We omit the details. B 


7 CONCLUSION 


In the logic of graded modalities it is possible to express properties of finite sets. 
The usual inference calculi for this logic generate for all sets used in the proof at 
least as many constants (witnesses) as the cardinality of each set. Even for moder- 
ate values a vast number of witnesses are generated which are processed by case 
distinctions in the proof. 

In this paper we present an alternative method which avoids case distinctions, 
instead our method uses limited arithmetical reasoning. It arises in a series of trans- 
formation steps. First, we translate the logic of graded modalities K into a new nor- 
mal multi-modal logic, called Kg. Unfortunately, K g does not reduce by the stan- 
dard relational translation to first-order logic. One of the axioms of Kg is second- 
order. We solved this irreducibility problem by, instead of using the relational trans- 
lation, using a functional translation with a particular optimization which exploits 
the richer structure of the functional models. 

Our method can also be applied in the field of knowledge representation. The 
terminological logic ACCN is closely related to the graded modal logic K. In fact, 
there is an exact correspondence between terminological operators and modal oper- 
ators. Our approach provides a viable alternative inference mechanism to the con- 
straint algorithms commonly used, which also suffer from the overhead of evalu- 
ating case distinctions. 

Our approach must be viewed as a first step toward efficient reasoning with finite 
sets. There are a number of open problems which need to be addressed. 


(i) A general completeness result for Kg would allow us to use the full expres- 
sivity of this system. As long as this is not proved, we can guarantee com- 
pleteness only for the original K formulae. This is what we wanted from the 
beginning, but a stronger result would be preferable. 
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(ii) Our first-order theory is represented by a set of axiom schemas which are 
understood to be conjunctions of all its instances with the numerical variables 
instantiated with concrete values. The implementation of the calculus will 
rely on theory resolution. The axiom schemas will be encoded as inference 
rules. Since the axiom schemas contain equations the realization will not be 
easy, but it is certainly solvable. 

(iii) The original logic of graded modalities is decidable. Accordingly, we expect 
a resolution strategy for the translated formulae can be developed that is com- 
plete and terminates. This has yet to be done. 

(iv) Our calculus is still limited in reasoning with arithmetical terms. It remains 
to be investigated whether and how this capability can be enhanced. 

(v) We have applied our methods to KL-ONE-type reasoning but only for rea- 
soning within the T-Box. This corresponds directly to that in modal logic. We 
haven't accounted for A-Box reasoning about concrete instantiations of con- 
cepts/sets and roles/relations. The functional translation applied to A-Box 
terms generates many equations. It is not immediate how these can be treated 
efficiently. 

(vi) The correspondence properties for the axioms of Kg except one are first- 
order. This does not rule out that Kz is complete with respect to a first-order 
model theory. If this were the case we can get a translation into predicate 
logic that avoids some equational reasoning. 
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FROM CLASSICAL TO NORMAL MODAL LOGICS 


1 INTRODUCTION 


Classical modal logics (Segerberg [27], Chellas [2]) are weaker than the well- 
known normal modal logics: The only rule that is common to all classical modal 


logics is 
RE: F&G 
OF + OG 


(We nevertheless note that this principle raises problems in systems containing equal- 


ity (Hughes and Cresswell [14]).) 
Classical modal logics do not necessarily validate 


RN: F 
OF 
RM: FG 


OF + OG 
C: (OF AQG) > O(F AG) 
K : (OF AO(F > G)) > OG 


which are valid in any normal modal logic. ! 
In an epistemic reading, adopting one of the above formulas as an axiom means 


to close knowledge under a particular principle. Precisely, RE corresponds to the 
principle of knowledge closure under logical equivalences, RN under logical truth, 


‘Under RE, the principles RN, RM, C, and K are respectively equivalent to 
N: OT 
M: O(F AG) 2 (QF AQG) 
C': (OF AO(F + G)) => OG 
K' : (QF A OG) > O(F AG) 
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RM under logical consequence, C under conjunction, and K under material im- 
plication. | 

These principles, in an epistemic reading, as well as in a deontic one, are not al- 
ways desirable (see e.g. (Fagin and Halpern [3]), (Jones and Pórn [16, 17])). In fact, 
each closure principle expresses an aspect of what has been called the omniscience 
problem. 

Each of these principles can be adopted independently of the others. Neverthe- 
less, under RE and RM, C is equivalent to K, and under RE and RN, K is equiv- 
alent to RM and C. 


Classical modal logics have possible worlds semantics in terms of minimal mod- 
els (cf. Scott-Montague structures in (Montague [20])), involving a neighbourhood 
function mapping worlds to sets of sets of worlds. Now each of the above closure 
principles identifies a particular class of minimal models. For several of these prin- 
ciples, specialisations of minimal models have been given such as augmented min- 
imal models, models with queer worlds or models with inaccessible worlds. They 
are more tailored to these principles and give a better account of the logic associated 
to the principle. 

While standard Kripke semantics can be translated straightforwardly into 
classical first-order logic, this is a priori not the case of the above modal logics. 
The reason is that the neighbourhood function of minimal models cannot be repre- 
sented directly by a first-order formula. 

In this paper (section 4) we show that nevertheless semantics in terms of mini- 
mal models can be expressed in first-order logic. We show this indirectly: What we 
prove is that minimal models can be translated into standard Kripke models (which 
on their turn can be translated into first-order logic). 

A major advantage of such translations - and in fact our main motivation - is that 
they allow to reuse the proof systems that have been developed for normal modal 
logics. 


In the rest of this paper we first briefly present several classes of minimal models 
and their simplified semantics (section 3). Then for each class we present a trans- 
lation into normal multimodal logics (section 4). 


2 GENERAL POINTS 


2.1 Language 


The language of modal logic is built on a set of propositional variables, classical 
connectives and a modal operator O. F, G, and H denote formulas, and T and L 
respectively stand for logical truth and falsehood. © F is an abreviation of ^F. 
FOR denotes the set of formulas. 
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2.2 Semantics 


Semantics is stated in terms of frames and models. Generally, a frame is composed 
of a set W (whose elements are called worlds) and some structure S on W. In the 
well-known case of K -frames (or Kripke frames), the structure is just some binary 
relation r on W:r C W xW.? Then a model is composed of a frame and a meaning 
function m mapping propositional variables to sets of worlds. 

Given a model, it is the truth conditions which uniquely determine a ternary forc- 
ing relation between models, worlds, and formulas. In the case of propositional 
variables and classical connectives, the truth conditions are the usual ones: 


e M,w E- F iff w € m(F) if F is a propositional variable. 
e MwEFAGiff M,w E Fand M,w E- G. 
e M,w [- aF iff not(M,w E F). 


The structure of W is exploited when it comes to the truth condition for the modal 
operator. In the case of standard Kripke semantics, the forcing relation must satisfy 
the following one: 


e M,w [- OF iff for all v € r(w), M,v E F. 


Structures that are richer have more complex truth conditions. In order to avoid 
confusions we shall add a superscript to [- designating the type of the non-normal 
semantics. 

Generally, in a given semantics, we say that a formula F is true ina model M = 
(W, S,m) if M,w [- F for every w € W. A formula F is true in a frame (W, S) 
if for every every meaning function m, F is true in (W, S, m). A formula F is valid 
in a class of frames F (noted =p F) if F is true in every frame of F. 

A particular semantics will always be identified by a condition on the structure 
type together with the truth condition for the modal operator. An example in stan- 
dard Kripke semantics is condition t : w € r(w) for every w € W. (As in (Chellas 
[2]), conditions on structures are denoted by small letters.) 

We confuse conditions and the class of frames satisfying them: k being the case 
of standard Kripke models, the class of standard Kripke frames satisfying condition 
t is called k + ¢ (or kt for short). 


2.3  Axiomatics 


When a class of frames can be characterized by some axiom system we denote the 
latter by the corresonding capital letters. E.g. the basic normal modal logic is ealled 
K,and x F expresses that F is a theorem of K. The class of frames kt being 


*We shall often write w’ € r(w) instead of (w, w') € r. 
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characterized by the T-axiom OF — F', the corresponding logic is called K + T 
(or KT for short). 


2.4 Multimodal Logics 


Multimodal languages generalize modal languages by allowing indexed modal op- 
erators. The target logics of our translations being bi- or trimodal logics, what we 
need is a language containing three modal operators [1], [2], [3]. 

The multimodal logics we need are normal ones having a standard Kripke se- 
mantics. We index every accessibility relation and condition on it by the number of 
the corresponding modal operator. E.g. t[1] expresses that the accessibility relation 
rı is reflexive. As before, classes of multimodal frames are refered to by sums of 
conditions. k[1] + k[2] denotes the class of bimodal frames, and k[1] + k[2] + k[3] 
the class of trimodal frames. (Sometimes we write k[1, 2] and k[1, 2, 3] for short.) 

On the axiomatical side, we index axioms and inference rules by the correspond- 
ing number. E.g. K[1] is the axiom ([1]F A [1](F — G)) — [1]G, and D[2] is 
[2] — (2)F. As before, multimodal systems are referred to by sums of axiom or 
inference rule names. Our basic normal bimodal logic is K [1] + K [2], and the basic 
trimodal one is K[1] + K[2] + K[3] (X[1, 2] and K [1, 2, 3] for short). 


3 MINIMAL FRAMES AND THEIR CHILDREN 


In this section we present the class of minimal frames as well as three subclasses of 
it: supplemented minimal frames, quasi-filters, and singleton minimal frames. (The 
last subclass corresponds to Humberstone's inaccessible worlds logic.) 


3.1 Minimal Frames 


Minimal frames are the basic semantical tool for classical modal logics. 
A minimal frame (Chellas [2]) is a couple (W, N), where 


e W isaset of worlds, and 
e N:W > 2” maps worlds to sets of sets of worlds. 


Sets of worlds being usually called propositions, we may also say that N associates 
a set of propositions to every world. N (w) is sometimes called the neighbourhood 
of w. We recall that then a minimal model is a triple (W, N, m) where (W, N) isa 
minimal frame, and m is a meaning function. 

The forcing relation [-"'*^ results from the following truth condition for the 
modal connective: 


e M,w [-""" OF iff there is V € N(w) such that (v € V iff M,v K™” F) 
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Hence OF is true in a world if its neighbourhood contains the extension of F. 

The class of minimal frames is noted e. Subclasses of e can be obtained by adding 
conditions on the mapping N. An example of a condition is t: w € V for all V € 
N (w) and w € W. The class of minimal frames satisfying condition t is called 
e + t, or et for short. 

The class of minimal frames e can be characterized by some axiom system for 
classical propositional logic plus the inference rule RE (Chellas [2]). This basic 
classical modal logic is called E. 

The class of frames et is characterized by the T-axiom OF — F. As expected, 
the corresponding logic is called E + T, or ET for short. 


REMARK 1 There is an isomorphic form of minimal frames (see e.g. (Fitting [5])), 
where there is a set of accessibility relations instead of the neighbourhood function 
N. In order to account for worlds with empty neighbourhoods, just as in regular 
modal logics the concept of a queer world is used. ? 

Formally, Fitting's frames are triples (W, Q, R), where W and Q are sets of 
worlds such that Q C W and R is a set of relations on W. Q is called the set of 
queer worlds. Here, the forcing relation E-*"^ must fulfil the following truth con- 
dition for the modal operator: 


e M,w | OF iffw € Q, andthereisr € Rsuchthatv € r(w)iff Mv EE F 


Hence in a queer world all formulas or the form OF are false. 


3.2 Singleton Minimal Frames 


Singleton minimal frames correspond to inaccessible world frames that were intro- 
duced and axiomatized in (Humberstone ([15]). 

A singleton minimal frame is a minimal frame, where for every world w, the set 
N (w) is a singleton. In other words, there is exactly one propostion that is necessary 
in a given world. 

The only principle of our list that supplemented minimal frames validate is that 
of closure under conjunction: 


C : (OF AOG) 5 O(F AG) 


There are countermodels for RN, RM, and K. 

Clearly, singleton minimal frames are isomorphic to standard Kripke frames of 
the form (W, r). Hence the only difference from the standard Kripke semantics is 
the modal truth condition: 


e M,w E*" OF iff (v € r(w) iff M,v EF” F) 


3Note that although Fitting uses these frames only for monotonic modal logics, they can 
be used as well to give semantics to classical modal logics. 
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Note that RE and C are not enough to completely characterize singleton mini- 
mal frames, and that Humberstone’s ([15]) axiomatization is infinitary. This logic 
has also been studied formally in (Goranko [11], Goranko and Passy [13]). The con- 
cept of inaccessible worlds is fundamental in the logics of knowledge and belief of 
Levesque ([19]). There, OF is read “I only know F”. 


3.3 Supplemented Minimal Frames 


Supplemented minimal frames are at the base of monotonic modal logics. 

A supplemented minimal frame is a minimal frame where for every world w, the 
set N (w) is closed under supersets. * 

Supplemented minimal frames can be characterized by the principle of closure 
under logical consequences: 


RM: FG 
OF + OG 


(from which RE can be derived). There are countermodels for RN, C, and K. 
Modal logics containing RE and RM are called monotonic, and the basic mono- 
tonic logic is called E M. 

There is a slightly different semantics where the closure under supersets is im- 
plicit in the truth condition. It has been used e.g. in Fitting's ([5]) underlying logic 
U. There, frames are just the minimal ones. It is in the forcing relation (that we 
note [-5"?) where the truth condition for the modal operator is different from that 
of classical modal logics: 


e M,w |°"? OF iff there is V € N(w) such that (v € V implies M, v [-5"? 
F) 


In other words, we can simulate superset closure by replacing “iff” by “implies” in 
the truth conditions. 

For the same semantics, the terms local reasoning frames and logic of local rea- 
soning have been employed by Fagin and Halpern ([3]). They have employed these 
logics to model implicit and explicit belief: Explicit implies implicit belief, and the 
operator of explicit belief has a monotonic modal logic, whereas that of implicit 
belief has a normal modal logic. 


3.4  Quasi-Filters 


Quasi-filters are the base of regular modal logics. 

A supplemented minimal frame is a quasi-filter if N (w) is closed under super- 
sets and under finite intersection, for every world w. Closure under finite intersec- 
tion can be strengthened to closure under arbitrary intersections (v. (Chellas [2]), p. 


* An equivalent condition is: If U N V € N(w) then U € N(w) and V € N(w). 
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255). Hence quasi-filters are isomorphic to local reasoning frames where for every 
world w, the set N (w) is either a singleton or the empty set. 

Such a presentation can be specialised to the following one in terms of 
augmented Kripke frames (Fitting [5], [6]). An augmented Kripke frame? is a 
quadruple (W, Q,r,m) where 


e W is aset of worlds, 
e Q C W isa set of worlds (called queer worlds), and 
e r C W? is arelation on W. 


Thus queer worlds correspond to worlds in minimal frames having an empty neigh- 
bourhood. In this semantics the truth condition for the modal connective is: 


e M,w [-?"9 OF iff (w € Q and for every v € r(w) : M,v H°" F) 


We are quite close to standard Kripke semantics here, except that there are some 
worlds that do not satisfy any formula of the form OF’. 

Augmented Kripke frames can be characterized (Fitting [5]) by the principles of 
closure under logical consequences and material implication: 


RM: FG 
OF + OG 


K: (QFAQ(F > G)) 9 OG 


They can also be characterized by RM and C (closure under conjunction). 
There are countermodels for RN (closure under logical truth). 

Modal logics containing RM and C are called regular, and the basic monotonic 
modal logic is called EMC. 


4 TRANSLATIONS 


In this section, for each class of classical modal logics that we have presented we 
give a translation into normal multimodal logics. 

Monotonic, regular, and inaccessible world modal logics are translated into bi- 
modal logics, whereas classical modal logics are translated into a trimodal modal 
logic. As the latter translation combines the principles of the previous ones, we give 
it at the end. 


5 Augmented Kripke frames should not be confused with augmented minimal frames 
(Chellas [2]) which are minimal frames that are isomorphic to standard Kripke frames. 
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4.1 Monotonic Modal Logics 
The Translation 


The idea of the translation is to give a world status to those propositions that are 
associated to worlds via the neighbourhood function N: Every element of N (w) is 
viewed as a world accessible from w. Thus “there isa V € N(w)” can be expressed 
via some existential modal operator (1). The truth of a formula in every element of 
V can then be expressed via a second universal modal operator [2]. In this way we 
can transform a minimal model into a standard Kripke model, and the logic of (1) 
and [2] is a normal one. 

Formally we define a translation 7 from monotonic modal logics into normal 
multi-modal logics as follows: 


e T(F) = F if F isa propositional variable 
e (OF) = (1)[2]r (F) 


and homomorphic for the cases of the classical connectives. 
The same translation has been given independently by F. Wolter ([28]) in order 
to prove the completeness of a large class of monotonic modal logics. 


THEOREM 2 Fem F iff Fata} T(F). 


Hence by soundness and completeness of EM and K[1, 2] we also have that a 
formula F is a theorem of basic monotonic modal logic E M if and only if T(F) is 
a theorem of normal bimodal logic K [1, 2]. 


REMARK 3 We can also translate the operator L of implicit belief of (Fagin and 
Halpern [3]) by adding the supplementary case T(LF) = [1][2]r(F). 


REMARK 4 At least for the basic monotonic modal logic EM we are able to 
strengthen our translation to (GF) = (1)1|r(F) and thus to translate into mono- 
modal logic. 


In the next theorem we give a more general result for those monotonic modal 
logics that are characterized by axioms among the following standard ones: 


D: OF — OF 
T: OF ~F 
4: OF > OOF 


The systems EM D, EMT, EM4, EM D4, EMT 4are complete (Chellas [2]). We 
denote by emd, emt, em4, emd4, emt4 the corresponding classes of frames. Com- 
pleteness of these logics is used in the following theorem: 
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THEOREM 5 Let «K be any sum of conditions on minimal frames among d,t, 4. 
Let A be the corresponding combination of axioms D,T,4. Then Eem4« F iff 


F kn,aper(A) TF). 


Hence using the completeness of EM D, EMT, EM4, EM D4, EMT 4wecan 
prove now theorems of these monotonic modal logics via our translation into par- 
ticular multi-modal logics. 

Note that the translations (D), 7(T), 7(4) of the standard modal axioms D, T, 
and 4 become multi-modal axioms in the style of Sahlqvist [26] for which 
completeness results (Catach [1], Kracht [18]) and automated deduction methods 
(Ohlbach [23], [24], Gasquet [7], [8], Nonnengart [21], Farifias and Herzig [4]) are 
known. Note also that the translations of the modal axioms 5 and B would become 
axioms which have not been studied yet in the literature. 


Examples 


EXAMPLE 6 The formula (Op ^ Oq) — O(p ^ q) is translated into 

((1)[2]p ^ (1)[2]a) ^ (1)[2](p ^ a). 
which can be proved neither in EM nor in EMD,EMT,EM4, EM D4, 
EMTA. 


Proofs 


We do not give the proofs, because the technique is a particular case of that for 
classical modal logics (cf. subsection 4.4). 


4.2 Regular Modal Logics 


The Translation 


Having at our disposal the preceding translation for monotonic modal logics, an im- 
mediate way to translate regular modal logics is to prove that Fem. F iff 
F k[1,3]-(c) T(F), where C is the axiom of closure under conjunction, and 7 is 
the same as in subsection 4.1. Nevertheless, 

T(C) = ((1)[2]7(F) ^ (1)[2]r(G)) —> (1)[2](7(F) ^ T(G))is a rather complex 
axiom. Here we give an optimization by associating a special axiom just to the first 
modal operator [1]. This will also permit to state a more general exactness theorem. 

What we shall prove is that the translation is exact when the modal logics for [1] 
and [2] are normal, and moreover that for [1] satisfies an axiom that we call T; (the 
converse of the standard 7'-axiom): 


T,: F > [1]F 


Semantically, this corresponds to the condition te that the accessibility relation rı 
is a subset of the diagonal of W: rı C ôw. (In other words, for every world w, 
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r1 (w) is either a singleton w or the empty set). The idea behind this axiom T. is 
that, in our proof, queer worlds will not be in rı, while OA will be translated into 
(1)[2] A; queer worlds will correspond to worlds without r; -successors and hence 
will invalidate (1)[2]A. 

Clearly, 7(C) is a theorem of K [1, 2] + T,[1]. In the following theorem we give 
a general result for systems of regular modal logics: 


THEOREM 7 Let k be any semantical condition on accessibility relations r of aug- 
mented Kripke frames. Let (1, 2) be the translation of k on accessibility relations 
rı and rz of standard Kripke frames (u d Q being translated into (u,v) € rı and 
(u,v) €r into (u,v) € r». 

Then Feme+n F iff Fkp,2) 4) []4«(2) TF). 


Hence if « can be characterized by an axiom A and «(1, 2) by a bimodal axiom 
A’, then we can prove theorems of regular modal logic EMC + A by proving 
theorems of the normal bimodal logic K[1, 2] + T,[1] + A’. 


THEOREM 8 Let k be any semantical condition on accessibility relations r of aug- 
mented Kripke frames that is characterized by some axiom A. Let &(1,2) be as in 
the previous theorem. Then Femc+n F iff koe pera) T(F). 


Examples 


EXAMPLE 9 The formula7(C) =(QpAQgq) > O(pAq) is translated into 7(C) = 
((1)[2]p A (1)[2]q) ^ (1)[2](p ^ a). 

As ((L)F ^ (1)G) ^ (1)(F A G) can be proved in K[1, 2] + T,[1], the formula 
T(C) is a theorem. 


Proofs 


LEMMA 10 For augmented Kripke models of the form Maug = (W,Q,r,m) let 
f be a mapping such that f (Maug) = (W, Ri, R2, m) where 


e Ri = dg = {(w,w) : w ¢ Q}, 
e Ro =R. 


Then f is an isomorphism between augmented Kripke models and standard 
Kripke K[1,2] + T.[1]-models. 


Proof. It is easy to see that for every augmented Kripke model Maug, f (Maug) isa 
K[1, 2] 4- T,[1]-model, and Maug, w [-^"9 F iff M, w E- T(F), for every w € W. 

In the other sense, for every K[1, 2] + T.[{1]-model M, we have that f^! (M) is 
defined and is an augmented Kripke model, and M,w EL 7(F) iff Maug, 
w E*"3 F, for every w € W. E 
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Thus we also have an isomorphism between EM + A-models and K[1, 2] + 
T«[1] + 7(.A)-models. Theorem 7 follows immediately from that. 


4.3 | Humberstone's Inaccessible Worlds Logic 
The Translation 


The idea of the translation has been given in (Goranko [12]). It is the following: 
The truth condition is equivalent to 


e Miw, w E-*" OF iff for all v € W 


- ifv € R(w) then Mi, v E-*" F, and 
- ifv € R(w) then Mi, v KY AF. 


where R denote the complement of R. What we do is to introduce two modal con- 
nectives, [1] to access R-successors, and [2] to access R-successors. This leads to 
the following translation 7 from logics with inaccessible worlds into normal bi- 
modal logics: 


e T(F) = F if F is a propositional variable 
e 7(OF) = [1]r(F) ^ [2]^7(F) 


and homomorphic for the cases of the classical connectives. 
In order to get an exact translation, we must choose KT5[1 U 2] ê as our target 
logic: 


THEOREM 11 E™ F iff - xrspus] T(F). 


Hence F is valid in Humberstone's inaccessible worlds logic iff 7() is valid in 
normal bimodal logic KT5[1 U 2]. 


REMARK 12 Contrarily to what could be expected, our target logic is not required 
to have accessibility relations for |1] and [2] with an empty intersection. In fact, it is 
sufficient to prove that K T'5|1 U 2] is characterized by the frames where: Ry U Ro 
is an equivalence relation and R4 N Rg = Ø. (This key lemma is given in the next 
section.) 


$By KT5[1 U 2] we mean a normal multi-modal logic where both modal connectives 
[1] and [2] have the axioms of modal logic K, plus the axioms T and 5, stated for the modal 
operator [1 U 2]. ([1 U 2] F is an abreviation for [1]F ^ [2] F.) In other words, KT'5[1 U 2] 
is axiomatized by some axiomatization of classical logic, necessitation rules for [1] and [2], 
plus the axioms [1 U 2]F — F and -[1 U 2]F — [1 U 2]-[1 U 2]F. It is well-known that 
KT5[1 U 2] is characterized by the class of Kripke frames (W, Ri, Ro) where R1 U Ro is 
an equivalence relation over W (see e.g. (Catach [1])). 
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REMARK 13 The axiomatization given in (Humberstone [15]) is infinitary. On 
the contrary, the above translation provides an indirect (because the modal con- 
nectives are not the original ones) but still modal axiomatization of this logic. Most 
of all, this axiomatization is finitary. 


Examples 


EXAMPLE 14 The formula OT is translated into [1] T ^ [2] T, which is not valid 
in K T5[1U 2]. Hence OT is not valid in inaccessible worlds logic. 


EXAMPLE 15 The formula O(p ^ q) — Op is translated into 

[1)(p ^ g) ^ [2]^(p ^ a) > [1]P ^ [2]7». 
In X T5[1U2], this formula is equivalent to the conjunction of (([1](pAq)^[2]^(p^ 
q)) ^ [1]p) and (([1](p ^ q) ^ [2] ^ (p ^ q)) > [2]-p). Now the first conjunct is a 
theorem of K T'5[1 U 2] (because of [1](p ^ q) > [1]p), but the second is not. 


EXAMPLE 16 The formula (Op A Oq) — (p «€ q) is translated into 
(Ep ^ [2]>p ^ [1]a ^ [2]^a) + (p + 4). 

In KT5[1U 2], the antecedent is equivalent to ([1](p © q) ^ [2](p © q), and now 
((1](p + 9) ^ [2)(p + a) > (p + a) 


is an instance of the T[1 U 2]-axiom. 
Proofs 
The proof has been first given in (Goranko [12]). 


LEMMA 17 Let Mj, = (W, R,m) be an inaccessible worlds model wherein a 
formula F is satisfied at wg. Let Ry = R and R5 = W? \ R. Then (W, Ri, Ro, 
m) isa KT5[1 U 2]-model satisfying T(F) at wo. 


Proof. First note that the K T'5[1U 2] axioms are true in (W, R1, R2) because Ry U 
Ro is an equivalence relation. Let M = (W, R1, Re,m). We prove that for every 
formula G and w € W, Mi,,w [-'" G iff M,w |= T(G) by induction on the 
structure of G. The only non-trivial case is 


e Miu, w EY OH 

e iff for all v € W, (v € R(w) iff Miw, v °! H) 

e iff for all v € W, (v € R(w) iff M,v }= T(H)), by induction hypothesis 
e iff for all v in W, 


— ifv € Rı(w) then M,v E- T(H) by construction of R;, and 
— ifv € Ro(w) then M,v E ^7(H), by construction of R2 
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e iff M,w E [17 (H) A [2]27T(H) 
e iff M,w | T(QH) 
E 


LEMMA 18 KT5[1U2] is characterized by the class of frames (W, R,, R2) where 
R, U Ro is universal (i.e. Ry U Ro = W?) and R; N Ro = Ø. 


Proof. As the converse is trivial, we only have to prove that if F is KT5[1 U 2]- 
satisfiable then it is also satisfiable in a frame (W, R4, R2) where R, U Rez is an 
equivalence relation and Rı N Rə = Ø. Suppose given a standard Kripke model 
M = (W, Ri, R5, m) such that M, wo = F for some wo € W. First, let W” be the 
connected part of W that contains wo, i.e. the set of worlds that can be reached from 
wo via R4 U R2. Let R}, R5, and m' be the respective restrictions of R1, R2, and m 
to W'. Itis well-known that M’ = (W', Rj, R5, m) is still a model of K T5[1U2], 
and M', wo E F. 

Now, we build a new model which will simulate (W’, Ri, R5, m’), but where the 
two relations will have an empty intersection. Let Vj and V2 be two sets such that 
Vi, V2 and W' are pairwise disjoint, Vi is isomorphic to W' via an isomorphism 
fı, and V2 is isomorphic to W’ via an isomorphism f2.(For example, V; could be 
W x {i}.)Let V denote V; U Vz, and let RY and R3 be relations on V as follows: 


1. RY = ((fi(v), fi(w)): w € W, i = 1,2 and (v, w) € R,jU 
{(fi(v), fa-i(w)): w € W, i = 1,2, (v,w) € Fa and (v,w) ¢ Ro} 

2. R! = {(fi(v), fi(w)): w € W, i = 1,2and (v, w) € R2}U 
{(fi(v), fa-;(w)):w € W, i = 1,2, (v,w) € Re and(v,w) ¢ Ri} 


Graphically, 
a a 
u U u U u U 
HE z4 telle a4 
Will give: 


filu) —— fi (v) fiu) SIZE » fi(v) fiu) ——> fie) 


E, 1 
n à E ^a 
fo(u) —— flv) falu) ee > fo(v) fo(u) ——— fa(v) 


Where dotted lines denote R2 and full lines R1. 
Finally, let m” be defined by: f;(w) € m"(F) iff w € m(F), for F a proposi- 
tional variable. 
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Then let o = f[1 U f7!. yis a pseudo-epimorphism from (V, RY, RY, m") onto 
(W', Ri, R5, m) (Hughes and Cresswell [14]). This ensures that F is satisfiable in 
(V, RY, RY, m"). Moreover, it can easily be shown that RY U R3 is an equivalence 
relation over V, and even that Ri’ U RY = V?, and that RY N RY = Ø. Hence the 
lemma is proved. a 


LEMMA 19 Let (W, R,, R2) be a frame for KT5[1 U 2] wherein T(F) is satisfi- 
able. Then there is an inaccessible worlds frame wherein F is satisfiable. 


Proof. The proof is similar to that in (Humberstone [15]). E 


Suppose M = (W, Rı, R2, m), and M, wo F T(F) for some wo € W. By the 
above Lemma 18, we can suppose that E; N Rz = Ø, and that Ry U Rz = W?, 
i. e. R2 is the complement of F4. 

Let Mi, = (W, R1,m). Clearly M; is an inaccessible worlds model. We prove 
by induction that for every w € W and every formula F, we have M,w f= 7(F) 
iff Miw, w E-*" F. The only non-trivial case is: F = OG, i.e. r(F) = [1]r(G) ^ 
[2]^7(G). 


e From the left to right, suppose M,w H [1]r(G) ^ [2]^7(G). Let v be any 
world from W. If v € Ry(w), as M,w = [1]r(G), we have that M,v E- 
T(G), and by induction hypothesis, M;i,, v E-*" G. Ifv € R,(w) then v € 
Ra3(w) (because Ry U Ro = W*?). As M,w E [2]-7(G), we have that 
M,v E ^7(G), and by induction hypothesis, M;,, v |°! ~G. Putting both 
together we get that Miw, w E^" OG. 


e From the right to the left, suppose Miw, w E^" OG. Then v is in Rj (w) iff 
M,v E?" G. By induction hypothesis, v € R4 (w) iff M,v E- T(G). Hence 
M,w E [1]|r(G). As R;,:nR2 = Ø, we have that M, w E- [2]-7(G). Putting 
both together we get that M, w E- [1]r(G) ^ [2]^7(G). 


REMARK 20 It is clear that this lemma cannot handle extensions of the basic in- 
accessible world logic e.g. with axiom D. The reason is that we cannot ensure that 
our construction of the inaccessible world model preserves the accessibility rela- 
tion properties. 


Now the theorem follows immediately from Lemma 17 and 19. 


4.4 Classical Modal Logics 
The Translation 


The translation 7 from classical modal logics into multi-modal logics combines 
the above translations of monotonic modal logics and Humberstone's inaccessible 
worlds logic. It goes as follows: 
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e T(F) = F if F is a propositional variable 
e T(GF) = (1)(]r(F) A [3]o7(F)) 


and homomorphic for the cases of the classical connectives. 
Then we have 


THEOREM 21 Fe F iff V x(1,2,3] T(F). 


Hence a formula F is valid in classical modal logic E iff r(F) is valid in normal 
multi-modal logic K[1, 2, 3]. 


REMARK 22 Note that at least for the basic classical modal logic E we are able 
to strengthen our translation to r(OF) = (1)([2]7(F) A [1]>7(F)). 


In the next theorem we extend our result to ET. We translate into K [1,2,3] + 
Bı 2, where B 9 is the axiom (1)|2]G — G. Bı,2 axiomatizes a condition on 
frames that we call bj ? : Rı C Rz. 


THEOREM 23 Het F iff V kp,2,3]- B1. T(F). 


Note that axiom B 2 is in fact the monotonic modal logic translation of the T- 
axiom. 
The generalization towards extensions of E by other axioms such as D,4, B, T 
seems to be much more difficult. This is due to the fact that the monotonic trans- 
lations of these axioms do not completely axiomatize the translated models (see 
Lemma 25 in the proofs). On the other hand, the classical translation of e.g. ax- 
iom T yielding (1)([2]G ^ [312G) — G would also be a candidate for the target 
logic axioms’, but it is difficult to devise a semantics for such complex multimodal 
axioms. (Note nevertheless that we would only need a soundness result for the mul- 
timodal axiom). 


Examples 


EXAMPLE 24 The formula OT is translated into 

(1)([]T ^ [3] T). 
which in K[1,2,3] is equivalent to (1)[3].L. This is clearly not a theorem of 
K{1, 2, 3]. Hence OT is not valid in classical modal logic. 


"and not (i): (1) ([2]r(G)A[3]^7 (G)) > 7(G) as a particular instance of T is OG — G, 
for G propositional variable, whose translation is (1)([2]G ^ [3]2G — G which give (by 
substitution) (1)([2]G ^ [3]2G) — G for any formula G. 
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Proofs 


LEMMA 25 Let Mmin = (W, N,m) be a minimal model. 
Let M = (V, Ri, R2, R3, m) be a model such that 


weW 


e Rı(w) = N(w), for every v € W, 
e R (U) =U, for every w € W and U € N(w), 
e R3(U) = W —U, for every w € W andU € N(w), 
Then for every formula F and world w € W, Mmin, w E""^ F if M, w E r(F). 


Proof. First, M is a model of K[1, 2, 3]. Then the proof is straightforward by induc- 
tion on F'. The only non-trivial case is that of F = OG. We have: Mmin, w E77 
DG 


iff there is U € N(w) such that for every v € W, (v € U iff Mii, 
v pmi G) 


iff (by induction hypothesis) there is U € N(w) s. th. for every v c W, 
(v € U iff M,v = T(G)) 


iff there isa U € N(w) s. th. for every v c W 
— ifv € R3(U) then M,v |= T(G) (by the definition of R2), and 
— if v € R3(U) then M,v E- ^7T(G)) (by the definition of R3) 
iff there is U € Ri (w) such that M,U E [2]r(G) ^ [3]-7(G) 
iff M,w E T(F). 


LEMMA 26 (Chellas [2], p. 261) If Fet F then -gr F. 


The next lemma is proven syntactically, because we can state it thus in a general 
way for extensions of E by any axiom A. 


LEMMA 27 Let A be any axiom schema. If- 44 F then p,2,3] 4c (4) T(F). 


Proof. We use induction on the proof of F: Whenever the axiom .A is used, we 
can replace it by r(.A). The other axioms are classical. Whenever the non-normal 
inference rule RE: 


FoG 
OF o OG 
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is used, we can replace it by 


1(F) + 7(G) 
adi) A [3]>7(F)) e (1)([2]7(G) ^ [3]>7(G)) 


which is a derived inference rule of K[1, 2, 3] (by substitution of equivalences). 8 
LEMMA 28 (Catach [1]) U k(,2,3] B, F implies Fit 2,3]401.2 F. 
THEOREM 29 F- F iff x11,2,3) T(F). 


Proof. The proof follows from Lemmas 25 and 27, using the completeness of E 
(Lemma 26). E 


THEOREM 30 Het F iff - x(1,2,3]+ B12 T(F). 


Proof. From the left to the right, the proof follows from Lemmas 27 and 26. 
From the right to the left, the proof is semantical: Given a minimal model, we 
must warrant that the normal model that we construct in Lemma 25 is a model for 
r(T) = ((1)(2]r(F) ^ [3]-7(F))) > F. 
Now the translated model of Lemma 25 already satisfies the required property. ? 
Then we take advantage of the soundness of the normal modal logic K [1, 2, 3] + 
Bi 2. 
a 


5 CONCLUSION 


We have given a translation from classical modal logics into normal modal logics. 
For particular classes of non-normal modal logics we have given specialised trans- 
lations. Precisely, we have proved the exactness of the translation for the following 
logics: 


e the basic classical modal logic E, 

e the basic monotonic modal logic EM, 

e the basic regular modal logic EMC, 

e Humberstone’s inaccessible worlds logic. 


We have also given exactness proofs for the extension of E with axiom T and 
for extensions of EM with axioms 


D: OF + OF 


5Note that this is not always the case (a simple example being the axiom © T). This makes 
it difficult to prove the exactness of other extension of E e.g. by axioms 5 or B. 
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T: OF > F 
4: OF > OOF 


Moreover we have proved exacteness of the translation for any extension of the 
basic regular modal logic EMC. 
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